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all claims. 

Status of Amendments 

Amendments made in Applicant's First Amendment After Final, mailed on December 27, 
2004, have been entered for purposes of appeal, and are reflected in the Claims Appendix 
attached hereto. 

Summary of Claimed Subject Matter 

The invention is a new process for constructing biologically active ligands with improved 
specificity and binding affinity for biological targets, by starting with a ligand that is known to 
bind a biological target, identifying a second ligand that binds to the biological target, and 
constructing a composite ligand from the two ligands in which the distance between the two 
ligands in the composite ligand, and the orientation of the ligands to one another in the 
composite ligand, closely mirrors their relative distance and orientation when bound to the 
substrate as separate ligands. 

In a principal embodiment, described in the specification as the "Direct Method" on 
pages 9:24-10:14, the process is carried out according to the following steps: 

1 . paramagnetically labeling a ligand that is known to bind to a biological target; 

2. preparing a complex between the paramagnetically labeled ligand and the biological 
target; 

3. preparing NMR spectra of the complex; 

4. modifying the complex by the addition of a second ligand that binds to the biological 
target; 

5. preparing NMR spectra of the modified complex; and 

6. analyzing the spectra to determine whether the second ligand has bound to the 
biological target within the paramagnetic zone of the label on the first ligand; 

7. further analyzing the spectra to determine the distance of separation between the first 
and second ligand when bound to the biological target; 

8. deducing the relative orientation of the first and second ligands when bound to the 
biological target; and 

9. linking the two ligands together at the distance of separation and orientation deduced 
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in steps 8 and 9. 

The combined ligands have a greater specificity and affinity for the biological target than either 
of the ligands by itself. 

Independent Claim 1 - Independent claim 1 covers the "Indirect Method" described 
above and in the specification at 9:24-10:14, but broadens it slightly to cover the several ways 
that the NMR spectra can be generated for the biological target and the second ligand. For 
example, the spectra can be generated from a complex formed between the biological target and 
the second ligand, and compared to the spectra from the complex formed between the biological 
target and the first ligand, or spectra can be generated from a tri-part complex formed between 
the biological target, the second ligand and the first ligand, and compared to the spectra from the 
complex formed between the biological target and the first ligand. Support for this expansion is 
contained in the specification at 10:26-10:30. 

Independent Claim 19 — Independent claim 19 covers a variation of the foregoing 
process, referred to in the specification as the "Direct Method," in which NMR spectra are also 
prepared of the second ligand and a paramagnetically labeled derivative of the first ligand in the 
absence of the biological target, and compared to spectra derived from a tri-part complex formed 
between the biological target, the second ligand and the first ligand, and spectra derived from a 
complex of the biological target and only one of the ligands, to deduce the relative distances of 
the ligands when bound to the biological target. This method is described in the specification at 
11:1-11:18. 

Grounds of Rejection to be Reviewed on Appeal 

The following issues are presented on appeal: 

(1) whether the preamble in claims 1 and 19 is unclear because it recites "a method for 
improving the binding affinity of ligand for a biological target," when the claim does not 
specifically recite an improvement in the binding affinity. 

(2) whether step (c) of claim 20 is rendered unclear by the presence of the term 
"substantially." 

(3) whether the claims are properly rejected as obvious over the combined teachings of 
Johnson et al. (1999) and Bolon et al. (1999). 
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Argument 

Claim Rejections - 35 U.S.C. § 112, second paragraph 

The Final Office Action rejects claims 1 and 19 because they recite a method for 
improving the binding affinity of a ligand for a target and do not provide a step of improving the 
binding affinity. This rejection is improper because (1) the objectionable limitation is found in 
the preamble, which does not even limit the claim, and (2) the binding affinity is inherently 
improved in a significant number of constructs whenever one carries out the recited sequence of 
steps. 

The Final Office Action rejects claim 20 because clause (c) contains the term 
"substantially." Applicant respectfully submits that a skilled worker would understand precisely 
what is meant by the term substantially, since it refers to a distance and orientation that improves 
the binding affinity of the resulting compound. 

Claim Rejections - 35 U.S.C. $ 103 

The Final Office Action also rejects the claims as obvious based on the combined 
teachings of Johnson (1999) and Bolon (1999). During a telephone interview conducted on 
March 15, 2005, the Examiner informed Applicant's attorney that she was relying on page 620, 
in column 2 of Johnson, which describes composite protein that Johnson constructed by linking 
together two cellulose binding domains (CBD N i and CBD N 2) of the CenC beta-l,4-glucanase 
enzyme. This linking of two protein domains is in contrast to the linking of two parts of a ligand 
that binds to a protein as described in the instant claims. Johnson was studying the composite 
protein with the two CBDs to gain some understanding of the binding characteristics of the 
CBDs to cellulose, but ultimately he concluded that the relative orientation of the CBD protein 
domains did not even matter, because "the tandem CBDs anchor CenC to its natural substrate . . . 
without a strong preference for their orientations." (P. 620 at col. 2.) 

Johnson did not link the CBD domains of the composite protein at a preferred orientation 
and distance to improve their binding affinity for a ligand, or undertake any steps to characterize 
that preferred distance and orientation, all as required by the claims of the present invention. He 
does not identify the structure of the linker, the distance between the two CBDs when bound to 
the linker, or the relative orientation of the two CBDs when bound to the linker. In fact, he states 
just the opposite in column 2 of page 620 when he states that "the spatial arrangement of these 
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modules (i.e. the two CBD ligands) within native C.fimi CenC remains to be defined." 
Continuing in the second column of page 620, Johnson further confirms the point when he states 
that the two ligands "cannot bind simultaneously to adjacent regions of a single polymer chain" 
due to "structural constraints" in the linker. 

Johnson never addressed the issue of a distance between a first and second ligand and 
never "determine[ed] whether a second ligand perturbs peaks on the second NMR spectra that 
are also perturbed by the paramagnetic label on the first NMR spectra," as required by clause (d) 
of claim 1 . That step would only be performed if one was interested in determining whether the 
two ligands bound to the target molecule at an ascertainable distance, and constructing a 
composite ligand at the distance ascertained. Moreover, no orientational data from dipolar 
couplings were collected as described in claims i.e. 7, 19(e) and 20(b). The "orientation" of the 
cellulose-derived ligand recited by Johnson actually refers to a "direction" of binding in the 
protein site which proves to be ambiguous, as opposed to a three dimensional orientation. This 
latter information was also dependent on use of assigned protein resonances, a step not needed in 
the present invention. 

To overcome these deficiencies, the Examiner makes reference to the fact that Johnson 
reports some spin label studies that he conducted in column 1 of page 620. These studies were 
performed to locate the binding site of a single ligand on the protein surface, not to define 
distances between two ligands; again it required assignment of protein resonances. Once again, 
this represents a fundamental difference between the method Johnson was employing and the 
method described in the instant claims. 

The Examiner also cites to Bolon (1999) to cure these deficiencies, but Bolon merely 
reports the availability of residual dipolar couplings to deduce the relative orientation of a ligand 
when bound to a biological target based upon observations of residual dipolar couplings. The 
article only analyzed the orientation of a single ligand relative to a protein structure. The authors 
did not bind two different ligands to the protein structure ~ as required by the pending claims — , 
deduce the relative orientations of the two ligands - as required by the pending claims — or 
determine the distance between the ligands - again as required by the pending claims, because he 
was not concerned with improving the binding affinity of a ligand for the biological target by 
making composite ligands. In short, the reference did not offer any suggestion or motivation for 
constructing composite ligands based upon their relative distances and orientations when bound 
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to a protein, as recited in the pending claim. Lacking such a suggestion or other motivation, the 
references do not support a prima facie case of obviousness. 



In view of the arguments presented herein, Applicants respectfully request that the final 
rejection in this matter be vacated, and that this application be returned to the examiner with 
instructions to enter a notice of allowance. 



December 20, 2005 

KING & SPALDING LLP 

191 Peachtree Street 

Atlanta, GA 30303 

(404) 572-4600 (Telephone) 

(404) 572-5145 (Facsimile) 

K&S Docket No. 04342. 105053 



CONCLUSION 



Respectfully submitted, 




Clark G. Sullivan 
Reg. No. 36,942 
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APPENDIX 1 
CLAIMS APPENDIX 

1) A method for improving the binding affinity of a ligand for a biological target comprising: 

a) preparing first NMR spectra of a first complex comprising the biological target and a 
paramagnetically labeled derivative of a first ligand to define a paramagnetic zone; 

b) preparing second NMR spectra of a second complex comprising the biological target 
and a second ligand; 

c) analyzing the spectra to determine whether the second ligand binds to the biological 
target within the paramagnetic zone of the paramagnetically labeled derivative; 

d) deducing a relative three-dimensional orientation of the first and second ligands when 
bound to the biological target; 

e) deducing a distance of separation of the first and second ligands when bound to the 
biological target; and 

f) selecting or preparing a compound that contains the first and second ligands 
substantially in the relative orientation and distance, wherein the binding affinity of 
the first ligand for the biological target is improved. 

2) The method of claim 1 wherein step (c) is performed by: 

a) identifying peaks on the first NMR spectra that are perturbed by the paramagnetic 
label; and 

b) determining whether the second ligand perturbs peaks on the second NMR spectra that 
are also perturbed by the paramagnetic label 

3) The method of claim 1 wherein the first complex further comprises the second ligand, and 
step (c) is performed by determining whether the paramagnetically labeled derivative of the 
first ligand perturbs peaks associated with the second ligand. 

4) The method of claim 3 further comprising, before step (c), preparing third NMR spectra of 
a mixture of the paramagnetically labeled derivative of the first ligand and the second 
ligand in the absence of the biological target. 

5) (CANCELLED) 
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6) The method of claim 1 wherein the distance of separation is determined as a function of the 
loss of intensity for NMR resonances from the second ligand. 

7) The method of claim 1 wherein the three dimensional orientation is deduced by producing 
a field ordered state in a medium comprising the biological target, the first ligand, and the 
second ligand, and analyzing dipolar couplings within the first and second ligands. 

8) The method of claim 7 wherein the field ordered state is produced by an aqueous dispersion 
of lipid bicelles having complementary charges to the biological target. 

9) The method of claim 7 wherein the field ordered state is produced by an aqueous dispersion 
of bacteriophage having a domain of the biological target in the outer coat. 

10) The method of claim 1 wherein the paramagnetic label is a nitroxide or metal chelate. 

11) The method of claim 1 wherein the first and second NMR spectra are two dimensional 
heteronuclear single quantum coherence spectra. 

12) The method of claim 1 wherein the biological target is isotopically labeled. 

13) The method of claim 1 wherein the biological target is a protein, and NMR resonances from 
the protein are not assigned to a sequence of the protein. 

14) The method of claim 1 wherein the biological target is a protein, and the three dimensional 
conformation of the protein is unknown. 

15) The method of claim 1 wherein the first ligand is an oligosaccharide, and the biological 
target is a protein. 

16) The method of claim 1 wherein the second complex comprises the first ligand, or a 
paramagnetically labeled derivative thereof. 

17) (CANCELLED) 

18) (CANCELLED) 

19) A method for improving the binding affinity of ligands for biological targets comprising: 

a) preparing first NMR spectra of a first complex comprising a biological target, a 
paramagnetically labeled derivative of a first ligand, and a second ligand; 

b) preparing second NMR spectra of a second complex comprising the biological target 
and either the second ligand or the paramagnetically labeled derivative of the first 
ligand; 

c) preparing third NMR spectra of a mixture of the paramagnetically labeled derivative 
of the first ligand and the second ligand in the absence of the biological target; 
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d) analyzing the spectra to determine whether the paramagnetically labeled derivative of 
the first ligand perturbs peaks associated with the second ligand; 

e) deducing a relative three-dimensional orientation of the first and second ligands when 
bound to the biological target; 

f) deducing a distance of separation of the first and second ligands when bound to the 
biological target; and 

g) selecting or preparing a compound that contains the first and second ligands 
substantially in the relative orientation and distance, wherein the binding affinity of 
the first ligand for the biological target is improved. 

20) The method of claim 19 further comprising: 

a) deducing from the NMR spectra the distance between the first and second ligands 
when bound to the biological target; 

b) deducing from the NMR spectra the relative three dimensional orientation of the first 
and second ligand when bound to the biological target; and 

c) selecting or preparing a hybrid ligand that contains the first and second ligands 
covalently linked substantially at the bond distance and relative orientation deduced in 
steps (a) and (b). 
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APPENDIX 2 
EVIDENCE APPENDIX 

Attached hereto are copies of the evidence entered by the Examiner and relied upon by 
appellant in the appeal. This evidence was cited by the Examiner at the following points during 
prosecution of this application: 

Johnson et al., J. Mol. Biol. (1999) 287:609-625, cited by the Examiner in Office Actions 
mailed February 25, 2004 and October 26, 2004. 

Bolon et al., J. Mol. Biol. (1999) 293:107-1 15, cited by the Examiner in an Office Action 
mailed October 26, 2004. 
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APPENDIX 3 
RELATED PROCEEDINGS APPENDIX 




CLAIMS APPENDIX 



APPENDIX 1 
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1) A method for improving the binding affinity of a ligand for a biological target comprising: 

a) preparing first NMR spectra of a first complex comprising the biological target and a 
paramagnetically labeled derivative of a first ligand to define a paramagnetic zone; 

b) preparing second NMR spectra of a second complex comprising the biological target 
and a second ligand; 

c) analyzing the spectra to determine whether the second ligand binds to the biological 
target within the paramagnetic zone of the paramagnetically labeled derivative; 

d) deducing a relative three-dimensional orientation of the first and second ligands when 
bound to the biological target; 

e) deducing a distance of separation of the first and second ligands when bound to the 
biological target; and 

f) selecting or preparing a compound that contains the first and second ligands 



2) The method of claim 1 wherein step (c) is performed by: 

a) identifying peaks on the first NMR spectra that are perturbed by the paramagnetic 
label; and 

b) determining whether the second ligand perturbs peaks on the second NMR spectra that 



3) The method of claim 1 wherein the first complex further comprises the second ligand, and 
step (c) is performed by determining whether the paramagnetically labeled derivative of the 
first ligand perturbs peaks associated with the second ligand. 

4) The method of claim 3 further comprising, before step (c), preparing third NMR spectra of 
a mixture of the paramagnetically labeled derivative of the first ligand and the second 
ligand in the absence of the biological target. 



substantially in the relative orientation and distance, wherein the binding affinity of 
the first ligand for the biological target is improved. 



are also perturbed by the paramagnetic label. 



5) (CANCELLED) 
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6) The method of claim 1 wherein the distance of separation is determined as a function of the 
loss of intensity for NMR resonances from the second ligand. 

7) The method of claim 1 wherein the three dimensional orientation is deduced by producing 
a field ordered state in a medium comprising the biological target, the first ligand, and the 
second ligand, and analyzing dipolar couplings within the first and second ligands. 

8) The method of claim 7 wherein the field ordered state is produced by an aqueous dispersion 
of lipid bicelles having complementary charges to the biological target. 

9) The method of claim 7 wherein the field ordered state is produced by an aqueous dispersion 
of bacteriophage having a domain of the biological target in the outer coat. 

10) The method of claim 1 wherein the paramagnetic label is a nitroxide or metal chelate. 

11) The method of claim 1 wherein the first and second NMR spectra are two dimensional 
heteronuclear single quantum coherence spectra. 

12) The method of claim 1 wherein the biological target is isotopically labeled. 

13) The method of claim 1 wherein the biological target is a protein, and NMR resonances from 
the protein are not assigned to a sequence of the protein. 

14) The method of claim 1 wherein the biological target is a protein, and the three dimensional 
conformation of the protein is unknown. 

15) The method of claim 1 wherein the first ligand is an oligosaccharide, and the biological 
target is a protein. 

16) The method of claim 1 wherein the second complex comprises the first ligand, or a 
paramagnetically labeled derivative thereof. 

17) (CANCELLED) 

18) (CANCELLED) 

19) A method for improving the binding affinity of ligands for biological targets comprising: 

a) preparing first NMR spectra of a first complex comprising a biological target, a 
paramagnetically labeled derivative of a first ligand, and a second ligand; 

b) preparing second NMR spectra of a second complex comprising the biological target 
and either the second ligand or the paramagnetically labeled derivative of the first 
ligand; 

c) preparing third NMR spectra of a mixture of the paramagnetically labeled derivative 
of the first ligand and the second ligand in the absence of the biological target; 
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d) analyzing the spectra to determine whether the paramagnetically labeled derivative of 
the first ligand perturbs peaks associated with the second ligand; 

e) deducing a relative three-dimensional orientation of the first and second ligands when 
bound to the biological target; 

f) deducing a distance of separation of the first and second ligands when bound to the 
biological target; and 

g) selecting or preparing a compound that contains the first and second ligands 
substantially in the relative orientation and distance, wherein the binding affinity of 
the first ligand for the biological target is improved. 

20) The method of claim 19 further comprising: 

a) deducing from the NMR spectra the distance between the first and second ligands 
when bound to the biological target; 

b) deducing from the NMR spectra the relative three dimensional orientation of the first 
and second ligand when bound to the biological target; and 

c) selecting or preparing a hybrid ligand that contains the first and second ligands 
covalently linked substantially at the bond distance and relative orientation deduced in 
steps (a) and (b). 
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APPENDIX 2 



EVIDENCE APPENDIX 



Attached hereto are copies of the evidence entered by the Examiner and relied upon by 
appellant in the appeal. This evidence was cited by the Examiner at the following points during 
prosecution of this application: 

Johnson et al., J. Mol. Biol. (1999) 287:609-625, cited by the Examiner in Office Actions 
mailed February 25, 2004 and October 26, 2004. 

Bolon et al., J. Mol. Biol. (1999) 293:107-1 15, cited by the Examiner in an Office Action 
mailed October 26, 2004. 
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The Cellulose-binding Domains from Cellulomonas 
fimi p-1,4-Glucanase CenC Bind Nitroxide Spin-labeled 
Cellooligosaccharides in Multiple Orientations 

Philip E. Johnson, Emmanuel Brun, Lloyd F. MacKenzie, 
Stephen G. Withers and Lawrence P. Mcintosh* 



Protein Engineering Network of 
Centres of Excellence and 
Department of Chemistry and 
Department of Biochemistry 
and Molecular Biology 
University of British Columbia 
Vancouver, British Columbia 
V6T 123, Canada 



Corresponding author 



The N-terminal cellulose-binding domains CBD N1 and CBD^ from 
Cellubmonas fimi cellulase CenC each adopt a jelly-roll p-sandwich struc- 
ture with a cleft into which amorphous cellulose and soluble cellooligo- 
saccharides bind. To determine the orientation of the sugar chain within 
these binding clefts, the association of TEMPO (2,2,6,6-tetamethylpiperi- 
dine-loxyl-4-yl) spin-labeled derivatives of cellotriose and cellotetraose 
with isolated CBD N1 and CBD^ was studied using heteronuclear l H- 15 N 
NMR spectroscopy. Quantitative binding measurements indicate that the 
TEMPO moiety does not significantly perturb the affinity of the celloo- 
ligo-saccharide derivatives for the CBDs. The paramagnetic enhance- 
ments of the amide 1 H N longitudinal (AR X ) and transverse (AR 2 ) 
relaxation rates were measured by comparing the effects of TEMPO-cello- 
tetraose in its nitroxide (oxidized) and hydroxylamine (reduced) forms 
on the two CBDs. The bound spin-label affects most significantly the 
relaxation rates of amides located at both ends of the sugar-binding cleft 
of each CBD. Similar results are observed with TEMPO-cellotriose bound 
to CBD N1 . This demonstrates that the TEMPO-labeled cellooligosacchar- 
ides, and by inference strands of amorphous cellulose, can associate with 
CBD N1 and CBD^ in either orientation across their 0-sheet binding clefts. 
The ratio of the association constants for binding in each of these two 
orientations is estimated to be within a factor of five to tenfold. This find- 
ing is consistent with the approximate symmetry of the hydrogen-bond- 
ing groups on both the cellooligosaccharides and the residues forming 
the binding clefts of the CenC CBDs. 

<; 1999 Academic Press 

Keywords: protein<arbohydrate interaction; NMR; spin label; 
glycosynthase 



Introduction 

The p-l,4-glucanase CenC from Cellulomonas fimi 
is modular in structure and function (Coutinho 
et al, 1992, 1991). Its central catalytic domain is 
connected by a short linker to two tandem N-term- 



inal cellulose-binding domains, CBD^ and 
CBD^CTomme et al, 1995). In contrast to most 
CBDs which bind crvstalline cellulose, the homolo- 
gous family IV CBD N1 and CBD^ from CenC are 
unique in their specificity for amorphous cellulose 
and soluble cellooligosaccharides. As part of a col- 



Present address: P. E. Johnson, Howard Hughes Medical Institute, Department of Chemistry and Biochemistry, 
University of Maryland Baltimore County, Baltimore, MD 21250, USA. 

Abbreviations used: CBD, cellulose-binding domain; CBD Ccx , the cellulose-binding domain from the mixed 
reUulase/xvlanase Cellulomonas fimi Cex; CBD^, the N-terminal cellulose-binding domain from Cellulomonas fimi 
M 4 elucanase CenC; CBD^, the cellulose-binding domain from Cellulomonas fimi p-l,4-glucanase CenC following 
CBDm in sequence; CBD N1N2 , the tandem cellulose-binding domains from Cellulomonas fimi |H,4-glucanase CenC; 
HSOC heteronuclear single quantum correlation; NOE, nuclear Overhauser effect; pH*, the observed pH meter 
reading without correction for isotope effects; TEMPO, 2^^tetramethvlpiperidine-lK)xyl-4-yl; TEMPO-GIC3, 
TEMPO-labeled cellotriose; TEMPO-Glc 4 , TEMPO-labeled cellotetraose. 
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laborative effort to exploit these domains for use in 
biotechnology, we have used NMR spectroscopy 
and calorimetry to provide a structural and 
thermodynamic foundation for understanding the 
distinct binding properties of CBD N1 and CBD^ 
(Creagh et aL, 1998; Johnson et aL, 1996a,b, 1998; 
Tomme et aL, 1996a). 

CBD N1 Qohnson et aL, 19%b) and CBD^ 
are composed of two five-stranded antiparallel 
(E.B., P.EJ. & L.P.M., unpublished results) 0-sheets 
that fold into very similar jelly-roll sandwich struc- 
tures. As demonstrated by NMR chemical shift 
perturbations and the observation of intermolecu- 
lar protein-sugar NOEs, a single cello- 
oligosaccharide molecule binds to these CBDs 
within a groove or cleft that runs across one 
P-sheet face of each protein domain (Johnson et aL, 
1996a,b). The presence of a clef t readily explains 
the binding specificity of these CBDs. Soluble eel- 
looligosaccharides and single polysaccharide 
chains, as might be encountered in the amorphous 
regions of cellulose, bind within this cleft, whereas 
flat crystalline arrays of cellulose are excluded. 
In agreement with the observation that the affinity 
of CBD N1 for cellooligosaccharides increases in 
the order cellotriose < cellotetraose < cellopentaose 
~ cellohexaose, the sugar binding sites in the CenC 
CBDs can each be spanned by approximately five 
glycosyl units (Johnson # a L, 1996a,b; Tomme et al., 
1996a). 

In parallel with these structural studies, a 
detailed calorimetric analysis of the association of 
CBD Ni with polysaccharides was also conducted 
(Creach et aL, 1998; Tomme et aL, 1996a). CBD N1 
was found to bind regenerated (phosphoric acid 
swollen) cellulose and other soluble (M,4-linked 
polymers of glucose, such as hydroxyethyl cellu- 
lose and barley and oat-p-glucan, with affinities 
equal to those measured for cellopentaose and 
cellohexaose. Formation of the CBD N1 -sugar com- 
plexes is favored enthalpically, indicating that 
hydrogen bonding and van der Waals interactions 
provide the primary driving force for the binding 
event. Inspection of the structures of CBD^! and 
CBD h42 reveals that their binding clefts consist of a 
central strip of hydrophobic side-chains, flanked 
on both sides by polar residues. This led us to 
propose that the pyranose rings of cellooligosacch- 
harides, and by inference single polysaccharide 



chains in regions of amorphous cellulose, are 
stacked: against the hydrophobic strip, while the 
flanking hydrophilic residues provide hydrogen 
bonds to the equatorial hydroxyl groups of the 
sugar (Johnson et aL, 1996a,b). Such interactions 
are often observed with carbohydrate-binding pro- 
teins (Quiocho, 1989, 1993; Vyas, 1991). 

The structures of CBD N1 and CBD^q were calcu- 
lated using NMR data collected for these protein 
domains in the presence of saturating concen- 
trations of cellotetraose or cellopentaose, 
respectively Qohnson et aL, 1996b). Numerous 
intermolecular nuclear Overhauser effects (NOEs) 
between the 13 C/ 15 N-labeled proteins and the 
unlabeled cellooligosaccharides were observed in 
isotope edited /filtered NOESY experiments 
(Otting & Wiithrich, 1990). However, the NMR 
spectra of the cellooligosaccharides are highly 
degenerate, thus preventing the unambiguous 
assignment of these NOEs to specific interactions 
between sugar and protein protons. This precluded 
the direct determination of the structures of the 
CBD-cellooligosaccharide complexes. 

As a first step towards achieving this goal, we 
have focussed on defining the orientation of a cel- 
lulose chain within the binding clefts of CBD N1 
and CBD N2 . Exploiting a novel glycosynthase tech- 
nology (Mackenzie et aL, 1998), we have prepared 
derivatives of cellotriose and cellotetraose with a 
2,2,6,6-tetramethylpiperidine>l-oxy-4-yl (TEMPO) 
spin-label covalently attached to the reducing end 
of the sugar (Figure 1). The use of such paramag- 
netic relaxation probes has a long history in the 
study of biological macromolecules by NMR spec- 
troscopy (Kosen, 1989). The nitroxide moiety cou- 
tains an unpaired electron that provides an 
efficient mechanism for the relaxation of neighbo r - 
ing nuclei via dipolar coupling. Due to the magni- 
tude of the electron magnetic moment, this 
interaction extends to over 20 A. In contrast, pro- 
ton-proton NOEs are limited to separations of less 
than -5 A. Thus, these TEMPO-labeled sugars can 
be utilized to obtain long-range distance infor- 
mation about the CBD-cellooligosaccharide com- 
plexes. 3n practice, the excellent dispersion of -ihe- 
signals ?from the amide *H- 15 N groups, combined 
with the high degree of sensitivity of the HSQC 
experiment (Cavanagh et aL, 1996), allows the 
effect of the spin-label cellooligosaccharides on 
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Figure 1. Chemical structures 
of the spin-labeled glycosides: (a) 
TEMPO-Glc 3 and (b) TEMPO-Glc 4 . 
Reduction of the paramagnetic 
nitroxide (NO) moiety produces 
the diamagnetic hydroxy la mine 
(N-OH) derivatives of. these 
glycosidt 



each non-proline residue in 15 N-labeled CenC 
CBDs to be evaluated. Similar studies using spin- 
labeled ligands and heteronuclear NMR methods 
have been reported by several groups (Gellespie & 
Shortle, 1997a,b; Zhao et al., 1997; Kleerekoper et aL, 
1995; Yu et a/., 1994). Using this approach, we 
demonstrate that the modified sugars bind to 
CBD N1 and CBD^ in both possible orientations 
across their (J-sheet binding clefts. This result is 
consistent with the approximate symmetry of the 
cellooligosaccharides and the positions of the side- 
chains forming the binding clefts of the two protein 
domains, as well as 15 N relaxation measurements 
which indicate that residues within the binding 
cleft of CBD N1 remain conformational^ flexible in 
the presence of these sugars (P.E.J., E.B. & L.RM, 
unpublished results). 

Results 

Preparation of TEMPO- 
labeled cellooligosaccharides 

Syntheses of the nitroxide spin-labeled cellotrio- 
side (TEMPO-Glc ? ) and cellotetraoside (TEMPO- 
Glc 4 ) were accomplished in a two-step chemo- 
enzymatic process, thereby avoiding many of the 
difficulties inherent in the preparation of oligosac- 
charide derivatives by chemical methods alone. 
A TEMPO-labeled glucoside was first synthesized 
via standard Koenigs-Knorr methodology, essen- 
tially according to a published procedure 
(Gnewuch & Sosnovsky, 1986; Plessas & Goldstein, 
1981). This product was characterized by mass 
and, to a limited extent, -H-NMR spectroscopy. 
Full NMR spectroscopic characterization was per- 
formed on the catalytically hydrogenated (reduced) 
and acetylated hydroxylamine derivative that no 
longer bears a free radical. Conversion of the 
TEMPO-glucoside to TEMPOGlc 3 and Glc 4 was 
accomplished by successive transfer of glucose 
residues from a-glucosyl fluoride using a mutant 
Agrobacterium sp. p-glucosidase (glycosynthase) in 
which the catalytic nudeophile, Glu358, was 
replaced by an alanine residue (Mackenzie et aL, 
1998). This approach exploits the well-known 
transglycosylation reaction catalyzed by retaining 
glycosidases, yet has the advantage that the 
mutant enzyme cannot hydrolyze the end pro- 
ducts, thereby resulting in good yields of -70% 
per coupling. The products were purified by HPLC 
and their identities confirmed by ionspray mass 
spectrometry, and after reduction and acetylation, 
by X H-NMR spectroscopy. 

Binding of TEMPO-labeled 
cellooligosaccharides to CBD N1 and CBD^ 

We have demonstrated previously, using NMR 
spectroscopy and isothermal titration calorimetry, 
that the monomeric CBD N1 binds soluble cellooli- 
gosaccharides with a stoichiometry of 1:1 (Johnson 
et aL, 1996a; Tomme et aL, 1996a). Similar results 
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Figure 2. The association constants of the CenC CBDs 
for TEMPO-labeled cellooligosaccharides were measured 
using l H- 15 N NMR spectroscopy. Shown are the nor- 
malized chemical shifts changes, or fraction bound, of 
the amide proton of Asn81 in CBDn! upon titration 
with TEMPOGlc 3 (filled diamonds) and TEMPOGlc 4 
(filled squares), and the nitrogen atom of Vall97 in 
CBD N2 upon addition of TEMPO-Glc 4 (open squares). 
The solid and broken lines represent the best fits of the 
data to the Langmuir isotherm describing the binding of 
one ligand to a single protein site on CSD^ or CBD^, 
respectively. The total chemical shift changes for Asn81 
and Vall97 were extrapolated to be 0.1 ppm ( X H N ) and 
0.76 ppm < l5 N), respectively. 



have been obtained with CBD^ (data not shown). 
Continuing this approach, the interactions of the 
TEMPO-labeled cellooligosaccharides with CBD NL 
and CBD^ vvere analyzed quantitatively by moni- 
toring the 1 H N and 15 N chemical shifts of the pro- 
teins upon titration with these soluble sugars. The 
spectral changes resulting from the addition of 
TEMPOGlc 3 and TEMPOGIC4 are similar to those 
observed with the corresponding unlabeled celloo- 
ligosaccharides, indicating that these ligands all 
bind in the regime of fast exchange on the chemical 
shift timescale to the same regions in CBD N1 and 
CBD N2 and with the same 1:1 stoichiometry (data 
not shown). Apparent or macroscopic equilibrium 
association constants were obtained by fitting the 
titration data to the Langmuir isotherm describing 
the binding of one ligand molecule to a single pro- 
tein sit£(Figure 2). 

TEMPO-Glc 3 binds to CBD N1 with an apparent 
K a of 690(±260) M* 1 . This is marginally higher 
than the value of 180(±60) M" 1 measured pre- 
viously for unlabeled cellotriose (Johnson et aL, 
1996a). -The apparent K a values of TEMPO-Glc 4 
for CBD N1 and CBD^ are 4200(±400) M" 1 and 
3600(±2200) M~\ respectively, while the associ- 
ation constants of CBD N1 and CBD N2 for unlabeled 
cellotetraose are 4200(±72Ctt M 1 Qohnson et :«., 
1996a) and 7(>00(±650) M' 1 (data not shown). The 
relatively lar^e errors in the K a values measured 
for the TEMPO-labeled cellooligosaccharides reflect 
the fact that the nitroxide efficiently broadens the 




signals from many amide groups within the bind- 
ing cleft to the point that they are no longer obser- 
vable. In general, it is these amide groups that 
show the largest chemical shift changes upon 
addition of sugar to the protein and would other- 
wise be best suited for fitting to the binding iso- 
therm equation. Within the error of the data, we 
conclude that the TEMPO moiety does not signifi- 
cantly perturb the affinity of the CenC CBDs for 
these cellooligosaccharides. By way of comparison, 
increases in affinity of ~10 to 20-fold result from 
the lengthening of cellotriose or cellotetraose by 
one additional glycosyl unit to form cellotetraose 
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or cellopentaose, respectively (Tomme et aL, 
1996a). 

TEMPO-labeled cellooligosaccharides bind ~ 
CBD N1 £nd CBDns in multiple orientations 

In addition to perturbing the chemical shifts of 
amide groups in CBD N1 and CBD^ the binding of 
TEMPO-Glc 3 and Glc 4 causes a decrease in the 
intensities of the signals from many of these 
groups due to paramagnetic relaxation enhance- 
ment This is readily seen by comparing the 1 H- l5 N 
HSQC spectra of two proteins complexed with the 
oxidized and reduced cellotetraose derivative 
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Fieure 3 Portions of the non-sensitivity enhanced a H- 15 N HSQC. spectra of (a), (b) CBDm! and (c), (d) CBD^ 
bound to (a), (c) oxidized or (b), (d) reduced TEMPO-Glc 4 . The finariigand-to-protein ratios of 22:1 and 1.4:1 corre- 
spond to -80% and 65% saturation of CBIXj, and CBD^, respectively. Peaks from amides that "disappear" because 
of paramagnetic relaxation due to the nearby unpaired electron in the rutroxide-labeled cellotetraose are identified. 
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(Figure 3). Qualitatively, it is clear that a number 
of resonances present in the spectra of proteins 
bound to the reduced hydroxylamine form of the 
sugar are "absent" in the spectra recorded with the 
oxidized nitroxide form, i.e. their intensities fall 
below a very low contour level. Thus, these amide 
groups must be close 11 A; vide infra) to the free 
electron in the nitroxide moiety. 

The amide protons whose resonances are absent 
in the 1 H- 15 N HSQC spectra of the complexes of 
TEMPOGlc 4 with CBD N1 and CBD^ can be classi- 
fied into two groups that are located on the oppo- 
site ends of the binding clefts of these proteins. (As 
shown in Figure 5, the binding cleft of each CBD 
lies across an antiparallel |J-sheet A formed by five 
strands, arranged as A1-A2-A5-A3-A4.) In the case 
of CBD N1 , the first group includes Vall7, Alal8, 
Tyrl9 and Gly20 from 0-strand Al, while the 
second group comprises Tyr65, Gly86 and Thr87 
from (J-strand A4 and the loop between A3 and 
A4. Similarly, for CBD^, the first group is formed 
by Serl66, Leu 167, Tyrl68, Glyl69 from p-strand 
Al, while the second group includes of Tyr234, 
Arg235 and Thr236 from (J-strand A4 and the pre- 
ceding loop. In contrast, peaks from amide protons 
in the central p-strand A5 of each CBD are present 
in the 1 H- 15 N HSQC spectra. Inspection of the ter- 
tiary structures of CBD N1 and CBD^ reveals that 
the outermost strands, Al and A4, are separated 
by ~20 A. The observation that the amide protons, 
which are most strongly perturbed by the nitroxide 
spin label, are located in both of these outer 
p-strands cannot be accounted for by the binding 
of TEMPOGlc 4 to the CBDs in a single orientation. 
The simplest explanation of this result is that 
TEMPO-Glc 4 associates with CBD N1 and CBD^ in 
at least two distinct orientations, such that the 
nitroxide moiety lies at either edge of their sugar 
binding clefts. 

The peaks from amide protons in p-strands Al 
and A4 that are absent in the spectrum of CBD N1 
bound to TEMPO-Glc 4 also disappear upon bind- 
ing TEMPO-Glc 3 (see Supplementary Material). In 
addition, signals from several amide protons, 
including those of Thr21 in strand Al and Ala41, 
Gln42 £ , Tyr43, Val48, Asn50, and Gly51 in strand 
A2, are weak in the spectrum of CBD N1 complexed 
with TEMPOGlc 4 and absent in the presence of 
TEMPO-Glc 3 . This qualitative distinction may 
reflect a slight difference in the positions of the 
two sugars when bound to CBD N1 . Several pro- 
tons, including the indole H el of Trpl6 and the 
amide H N of Leul46, which are not in the binding 
cleft of CBD N i, are also affected by TEMPO-Glc v 
This is attributed to non-specific effects arising 
from the addition of a 22-fold molar excess of the 
nitroxide-labeled sugar to the protein sample. 

Measurement of A/?, and &r\ 
relaxation enhancements 

A qualitative inspection of the X H- 15 N HSQC 
spectra of the two CBD-TEMPO-Glc 4 complexes 



provides immediate insights into the binding 
orientations of the sugars. Further details regarding 
the positions of the bound nitroxide moiety can be 
gained by a quantitative analysis of the effects of 
the spin label on the relaxation properties of the 
two CBDs. To reduce the possibility of non-specific 
interactions, CBD N1 and CBD^q were studied at 
saturation levels of approximately 80% and 65%, 
respectively (Figure 2). 

The enhancements of the longitudinal (AR : ) and 
transverse (Al?^ relaxation rates of CBD N1 and 
CBDm2 due to complexation with TEMPO-Glc 4 are 
presented in Figures 4 and 5. Data for CBD N1 
bound to TEMPO-Glc 3 are provided as Supplemen- 
tary Material. The AR a values were determined 
from the differences in the T x lifetimes, measured 
using a l H- l5 N HSQC sequence as a readout 
of a non-selective inversion-recovery sequence 
(Cavanagh et aL, 1996), for the proteins in the pre- 
sence of the nitroxide and hydroxylamine deriva- 
tives of cellotetraose. The AR 2 values were 
obtained using two complementary methods. In 
the first, the effect of the spin label on the proton 
transverse relaxation (AH 2 ,voi) °' ^ during 
the INEPT and reverse-INEPT period of the HSQC 
pulse sequence was measured by comparing the 
total volume of its cross-peak in spectra recorded 
with the oxidized and reduced sugar. In the 
second^ Ai? 2 .Lw was determined from the change 
in the proton line-width of the amide measured 
with the two forms of the modified cellotetraose. 
The methods for determining AR X and AR 2 assume 
that contributions to the peak volumes and line- 
widths due to scalar couplings and relaxation pro- 
cesses other than that mediated by the free radical 
remain constant, and that the oxidized and 
reduced glycosides interact identically with the 
CBDs. The lack of any significant amide l H N or 
15 N chemical shift perturbation upon reduction or 
the TEMPO group provides strong support tor 
these assumptions (Figure 3). 

Following these approaches, 101 AR V 64 
&R->\ 0 \' an <* 99 Ai? 2 ,Lw values were obtained for 
CBD Nl and 113 AR^ f 44 &R 2yoi , and 118 Aii'^ w- 
values for CBD^ (Figure 4). Rate enhancements 
were not measurable for the remaining non-prO- 
line residues for two reasons, besides spectral 
overlap. First, in the case of those amide protons 
that are closest to the nitroxide moiety, severe 
line-broadening rendered their signals undetect- 
able in the spectra recorded with oxidized 
TEMPOGlc 4 . For these residues, we estimate 
AR] > 2i5 s"* and AR 2 > 80 s" ! . Second, in the 
case uf those amide protons furthest trom the 
nitroxide group, changes in their relaxation rates 
were top small to measure reliably. In particular, 
dilution effects due to the addition of ascorbic 
acid and subsequent pH adjustments caused the 
volumes of the peaks in the X H- 15 N HSQC spec- 
tra of the proteins recorded with the reduced 
sugar to decrease slightly. This led to small 
apparent negative AjR Z voi values for many 
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Figure 5. Paramagnetic relaxation studies demonstrate that TEMPCMJlc 4 binds to each CenC CBD in at least two 
orientations such mat the nitroxide moiety can lie at either end of the binding cleft. Shown are ribbon diagrams of 
CBDmi (left) and CBDm 2 (right) positioned for the reader to look across (top) and down onto (bottom) the cellooligo- 
saccharide binding groove that is formed by p-sheet A in each protein. The individual strands of mis ^-sheet are 
labeled. The ribbons are ramp-colored based upon AR 2XVJ values ranging from 1 Hz (light yellow) to 80 Hz (dark 
orange). Residues whose l Hr signals disappear completely in the presence of TEMPO-Glc 4 are indicated in red 
(AK lw > 80 Hz). The proline residues and other residues for which, no results were obtained are shown in white. 
The^reen and blue spheres indicate representative positions of theiitroxide groups calculated using an ensemble- 
averaging protocol in X-PLOR with measured r rff as distance restrains. Note that the TEMPO-Glc 4 can occupy oniy 
one of these two orientations at any given time. 



amides whose relaxation rates were not perturbed 
significantly by the spin label. 

As seen in Figure 4, Ai? 1 and AR 2 show similar 
trends, yet the latter provide a more sensitive indi- 
cator of the effects of the spin-labeled sugars on 
CBD N1 and CBD^. In the cases of amide protons 
with resolved l H^N HSQC cross-peaks that 
experience moderate paramagnetic relaxation 
enhancements, the values of Al?2,voi and Ai? 2 , L w 
agree well (ratio = 1.7(±1)). However, a more com- 
plete data set could be obtained by measuring 
changes in line-width rather than volume due to 
the above mentioned problems associated with 
dilution effects and difficulties encountered with 
partially overlapping peaks. 

Based on these measurements, it is immediately 
apparent that amide groups in four regions of 
CBD N1 and four regions of CBD^ experience 
significant paramagnetic relaxation enhancements 
due to the bound TEMPOGlc 4 . These include 



GlylS to Thr21, Tyr43 to AsnSl, Gln80 to Asj?90 
and Glul22 to Serl33 in CBD^ and Glyl60^6 
Glul72, Trpl92 to Asnl99, Glu229 to Glu239 and 
Gly266 to Ala279 in CBD N2 . These residues lie 
within the five (i-strands, Al to A5, and adjacent 
loops that form the cellulose-binding clefts of 
each protein. Of these four regions, amide pro- 
teins in strands Al and A4 that comprise the 
outer edges of the binding face have markedly 
larger AR X and AR-> values than do those in the 
central strand A5. this verifies that TEMPO-Glc 4 
binds CBD N1 and CBD^ in multiple orientations 
such that the nitroxide spin label is located at 
either end of the binding cleft. In contrast, the 
relaxation rates of amide groups located on the 
opposite side of each protein relative to this cleft 
(p-strands Bl to B5) do not exhibit marked 
enhancements. Similar patterns are observed with 
TEMPOGlc 3 bound to CBD N1 (Supplementary 
Material). 
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Calculation of electron-proton distances 

Values of the correlation time for the fluctu- 
ation of the electronic-nuclear dipole-dipole inter- 
action, were calculated for individual residues in 
both CBD N1 and CBD^ bound to TEMPO-Glc 4 
using the measured AJ? 1 and AR 2 Vo i or AR 2 ,lw 
rate enhancements. In the case of CBD N1 , t c values 
for 81 amide protons ranged from 0.1 ns to 3.6 ns, 
with a mean of 1.7(±05) ns. For CBD^, these ran- 
ged from 0.1 ns to 6.5 ns for 101 amide protons, 
with a mean of 1.9(±0.8) ns (Figure 6). The corre- 
lation time t c is dependent upon both the relax- 
ation of the electron and motions of the electron- 
proton vector (1/Tc = 1/t s + 1/tr). In this equation, 
x s is the longitudinal relaxation time of tiie free 
electron and x R is the effective rotational corre- 
lation time of the electron-proton vector. The latter 
is dependent on the motional characteristics of the 
protein-ligand complex. Since t r is in the range ot 
10 8 to 10 9 s for most proteins studied by solution 
NMR methods, and x s is typically longer than 10 " 7 
s for nitroxide radicals, x c is essentially equal to x R 
(Kosen, 1989). The correlation times for the global 
tumbling of CBD N1 and CBD^ saturated with eel- 
lopentaose are approximately 7.4 ns, as determined 
by 15 N relaxation methods (P.E.J., E.B. & L.P.M., 
unpublished results). The shorter correlation times 
extracted from the paramagnetic relaxation 
enhancements may reflect conformational mobility 
of the bound TEMPO moiety. The large degree of 
variability in the x c values measured for the amide 
protons in CBD N1 and CBD^ is attributed to com- 



pounding errors from the AR r and &R 2 measure- 
ments, as well as different orientation-dependent 
effects of internal mobility on the distance between 
the free electron and the 1 H N of individual amide 
protons. 

Using the x c values measured for each residue, 
the effective electron-proton distances, r eff , between 
the TEMPOGlc 4 nitroxide and the individual 
amide protons in CBD^ and CBD^ were calcu- 
lated. A similar analysis of the data for the 
TEMPO-Glc 3 was not performed due to the occur- 
rence of non-specific relaxation enhancement. The 
most complete set of relaxation data was obtained 
from linewidth measurements, and thus the dis- 
tances reported in Figure 6 were determined using 
the AK 2 , L w values; however, similar results were 
derived from the AR 2 ,voi rate enhancements. Based 
on the data in Figure 6, we find that the measured 
values of r eil range from 11 to 23 A. Consistent 
with the prev ious discussion of the data presented 
in Figures 2 and 3, the amide protons in CBD Ni 
and CBD^ closest to the nitroxide group are 
located -in P-strands Al and A4 on the binding 
faces otthe proteins, whereas those that are furth- 
est form strands Bl to B5. 



Structure calculations 

The positions of the nitroxide moiety in TEMPO- 
Glc 4 bound to CBD N1 and CBD,^ were calculated 
using the measured r eff as distance restraints for a 
simple minimization routine in X-PLOR (Briinger, 



CBD 



CBD 





10 - 




e- 




6- 


(ns) 




4- 




2- 




o- 




30- 




24- 




18- 


r efl 


(A) 


12 - 




6" 




o- 



A1. Bl B2 A2 




T 

60 SO 100 
Residue Number 



144 160 



200 220 240 
Residue Number 



260 280 



Figure 6. The correlation times, x c , and effective electron-proton distances, r^, calculated for (a), (b) CBD N1 and (e), 
(d) CBDte bound to TEMPO-Glc 4 . Residues whose signals disappear completely due to severe linebroadening are 
shown with a shaded bar at an upper bounds of = 11 A. The locations of the P-strands forming sheets A and B in 
the two proteins are indicated by the open and filled arrows, respectively. 
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1992). To account for multiple binding orientations, 
we used an ensemble-averaging protocol in which 
r eH = Zr" 6 ]" 176 where r is the electron-proton 
distance for each of n conformations, assumed to 
be equally populated (Bonvin & Briinger, 19%). 
A total of 60 minimization runs were performed 
using CBD N i or CBD^ in a system with n = 1, 2, 
or 3 nitroxide groups, modeled as "atoms" of mass 
30 amu and a. van der Waals radii of 2.5 A. When 
only one nitroxide group is utilized, the "atom" is 
always located in the middle of the binding cleft, 
approximately above (J-strand A5. However, this 
solution poorly satisfies the experimental restraints 
requiring the spin label to be closer to amide pro- 
tons on the edges of the binding cleft than to those 
near its center. When an ensemble of two nitroxide 
atoms is considered, we find that in 31/60 and 60/ 
60 cases for CBD N1 and CBD N2/ respectively, one 
atom is localized to each edge of the binding cleft 
(Figure 5). In the remaining 29/60 cases for CBD N1 , 
one atom is found near the middle of the binding 
cleft and the other distant from the protein. The 
lowest energies clearly occur in the former situ- 
ation, with the nitroxide group near (3-strands Al 
and A4. The difference between the results 
obtained for the two proteins is attributed to the 
exact choice of the bounds used for the distance 
restraints. When an ensemble of three nitroxide 
atoms is generated, 24/60 and 42/60 cases for 
CBD N1 and CBD^, respectively, have one atom at 
each edge of the binding cleft and the third distant 
from the protein. In the remaining 36/60 and 18/ 
60 cases, the nitroxide groups occupy a variety of 
positions, close to and distant from the proteins. 

Based on these calculations, we conclude that 
the simplest explanation for the observed patterns 
of paramagnetic relaxation enhancement is that 
TEMPO-Glc 4 binds both CBD N1 and CBD^ in two 
possible orientations such that the nitroxide moiety 
lies at either end of the binding cleft (Figure 5). In 
support of these results, the separation of ~23 A 
between the positions calculated for the nitroxide 
atoms matches approximately the length of a 
TEMPO-Glc 4 molecule. 

Discussion 

Binding affinity for spin- 
labeled cellooligosaccharides 

CBD N1 and CBD^ bind TEMPOGlc 3 and 
TEMPO-Glc 4 with affinities that are approximately 
equal to those measured previously for the unmo- 
dified sugars. Although the six-membered ring of 
TEMPO resembles in some ways that of a gluco- 
pyranosyl unit, it differs due to the presence of 
methvl substituents and the absence of equatorial 
hvdroxyl groups. As discussed by Tomme et ai 
(1996a), calorimetric studies have revealed that the 
binding of soluble cellooligosaccharides to CBD N1 
is driven by enthalpically favorable interactions 
such as hydrogen bonding and van der Waals con- 
tacts. We therefore conclude that the TEMPO 



group does not interact significantly with CBD N1 
or CBDj^ and is positioned in or near the binding 
cleft solelv by virtue of its covalent bonding to the 
cellooligosaccharides. This statement is supported 
by the observation that the chemical shift changes 
accompanying the binding of the modified and 
unmodified cdlooligosaccharides to the two pro- 
teins are similar. We also conclude that the pre- 
sence of an anomeric hydroxyl group does n$t 
contribute to the binding of cellooligosaccharides 
by CBD Ni or CBD^. This is not unexpected given 
that cellulose, the natural ligand for these proteins, 
exists in a highly polymeric form. 

Multiple binding orientations 

Qualitative and quantitative analyses of the 
effects of TEMPO-Glc, and TEMPO-Glc 4 on the 
relaxation properties of CBD N1 and CBD^ clearly 
demonstrate that the labeled cellooligosaccharides 
are bound in at least two orientations. These orien- 
tations position the nitroxide moiety at either edge 
of the binding cleft of each CBD, such that amide 
protons in P-strands Al and A4, as well as in the 
loop between A3 and A4, are perturbed by the 
spin label to a greater extent that those in the cen- 
tral strand A5 (Figure 5). Since the CBDs form 1:1 
complexes with the cellooligosaccharides, these 
orientations are mutually exclusive such that in 
any individual CBD complex, the sugar is bound 
in one direction or the other. Given that the 
TEMPCMabeled sugars and the corresponding 
unmodified cellooligosaccharides bind to the CenC 
CBDs with similar affinities and produce similar 
chemical shift perturbations, we conclude Gl%r- 
Glc 4 , and by inference, amorphous cellulose and 
other soluble p-giucans, are also bound in multiple 
orientations. 

Binding of the unmodified and TEMPO-labeled 
cellooligosaccharides occurs on an NMR time-scale 
of fast exchange as evident by the progressive 
change in the chemical shirts of CBD N1 and CBD N2 
upon the addition of these sugars. Therefore, the 
chemical shift of an amide proton at any point in 
the titration reflects a population-weighted average 
of its chemical shifts in the free and all-bound 
forms of the CBD. As a result, the equilibrium 
association constants determined by NMR and 
calorimetric methods represent apparent or macro- 
scopic fC a values that are the summation of the 
microscopic binding constants for each possible 
orientation of the sugar-protein complex (Wyman 
& Gill, 1990). 

An important question to ask is whether a pre- 
ference for one binding orientation exists. During 
the titrations of CBD N1 and CBD^ with the nitrox- 
ide-labeled sugars, it was noted that the signals 
from residues on p-strand Al disappeared at lower 
levels of saturation than did those on A4 (Johnson, 
1998). This could indicate that the binding of 
TEMPO-Glc 4 in an orientation with the nitroxide 
moiety -near P-strand Al is slightly favored, or 
simply .that the nitroxide is positioned closer to the 
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amides in this strand relative to those in A4. If we 
assume that the TEMPO labeled sugars bind in fast 
exchange between two possible orientations, then 
the measured r ef( is given by: 
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Here r x and r 2 are the distances from an amide pro- 
ton to the two positions of the free electron, and f bl 
and f b2 are the fractions of total protein bound by 
the ligand in each orientation. Using the positions 
of the nitroxide atoms shown in Figure 5, we calcu- 
late that varying / bl from 0.1 to 0.9 does not change 
the predicted r eff for amide protons near the edges 
of the CBD N1 and CBD^ binding clefts by more 
±20%. In other words, within experimental error, 
the effects of any small population differences 
between the spin-labeled ligands bound in one or 
the other orientation cannot be distinguished due 
to the sixth-power dependence of the relaxation 
enhancement on the proton-electron separation. 
This is readily seen by considering a case for 
which r x = 10 A and r 2 = 20 A. The calculated r eff 
is 10.2, 11.2, and 14.4 A for / bl of 0.9, 0.5, and 
0.1, respectively. A similar conclusion was made 
by Bonvin & Briinger (1995, 1996) when they 
demonstrated that NOE-derived distances are not 
sufficient to determine the fractional occupancies 
of the individual conformations of a protein that 
exhibits conformational heterogeneity. Because of 
this limitation, we conservatively estimate that the 
relative association constants for the binding of 
TEMPOGlc 4 to CBD N1 or CBD^ in the two poss- 
ible orientations are within a factor of approxi- 
mately five- to tenfold. Note also that if r a = r 2 and 
/bi=/b2> then saturation of the CBD at 65% 
(/ bl = 0.325) and 80% (/ bl = 0.4) leads to an r eff 
that is overestimated by only ~7% and 4%, 
respectively. 

Extrapolating from these studies, it is reasonable 
to suggest that TEMPO-Glc 3 and TEMPOGlc 4 may 
be bound in multiple conformations that have one 
or the other of these two general orientations, yet 
also differing in the exact position (or register) of 
the sugar with the p-sheet clefts of CBD N1 and 
CBDm2. Evidence for this hypothesis stems from 
two observations. First, the longer r eH values 
(>15 A) measured in this study, were consistently 
shorter than the electron-amide proton distances 
predicted from the calculated positions of the nitr- 
oxide atoms shown in Figure 5. Bearing in mind 
the significant errors associated with the measure- 
ment of these distances (see below), this could 
reflect a small population of ligand shifted to pos- 
ition the nitroxide near the center of the binding 
clefts of the CBDs and thus transiently close to 
amide protons in strand A5 and p-sheet B. Second, 
as noted previously Qohnson et aL, 1996a), the *H N 
and chemical shifts of CBD N1 and CBD^ are 
perturbed similarly upon binding of Glc 3 , Glc 4 , 
Glc 5 , and Glc*. This suggests that the four cellooli- 
gosaccharides interact structurally with the CBDs 



in the same manner, possibly by sliding within 
their binding clefts. 

Structural symmetry can explain the multiple - 
binding orientations _ 

An explanation for the binding of cellooligosac* 
charides by CBDnj and CBD^ in multiple orien- 
tations is provided by inspection of the structures 
of both the ligands and the protein molecules. 
Hydrogen bond formation between the oligosac- 
charide and the protein provides an enthalpic driv- 
ing force for binding (Tomme et aL, 1996a), and 
thus the locations of polar atoms within the mol- 
ecules should be considered. As shown in Figure 7, 
the hydroxyl and hydroxymethyl groups of cellote- 
traose are approximately symmetrically disposed 
such that comparable hydrogen bonds could be 
formed-to the CBDs whether the reducing end of 
the cdlooligosaccharide is located closer to 
p-strand Al or A4. In addition, as illustrated in 
Figure 8, the polar, non-polar, and aromatic side- 
chains implicated in cdlooligosaccharide recog- 
nition (Johnson et aL, 1996a,b) are approximately 
symmetrically positioned about the centers of the 
binding clefts of CBD N1 and CBD^. This provides 
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Figure Cellotetraose has approximate symmetry 
such that its hydroxyl and hydroxymethyl groups 
would occupy similar positions when the sugar chain is 
bound in either orientation by CBDmi and CBD N2 . 
(a), (b) Cellotetraose (Gepler et a/., 1994) with its redu- 
cing end positioned at the bottom or top of the Figure, 
respectively. The molecules are related by successive 
180 c rotations about axes perpendicular and parallel to 
the long axis of the cellooligosaccharide. 
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Figure 8. The residues implicated in cellulose recognition by the CenC CBDs are positioned approximately symme- 
trically about the centers of their binding clefts. This may facilitate the binding of a cellooligosaccharide chain in 
either orientation. Shown are ribbon diagrams of CBD NJ (left) and CBD^ (right) positioned for the reader to look 
across (top) and down onto (bottom) the cellooligosaccharide binding cleft that is formed by p-sheet A in each pro- 
tein. The side<hains of selected residues within the cellulose binding clefts are color coded as blue, polar; red, hydro- 
phobic; and yellow, aromatic. 



further opportunity for the complex formation of 
the ceUooligosaccharides in multiple orientations, 
while maintaining similar patterns of hydrogen 
bonds to the equatorial hydroxyl groups and 
van der Waals contacts to the glucopyranose 
rings. 

It is unlikely that all the oxygen atoms in cellote- 
traose and the polar atoms in the binding clefts of 
CBD N1 or CBDfSE axe involved simultaneously in 
intermolecular hydrogen bonding. The association 
of cellotetraose with CBD N1 involves a AG 0 of 
-4.9 kcal mol" 1 at 35 °C (Tomme et al, 1996a). If 
the sole contribution to this binding free energy is 
assumed to be hydrogen bonding involving the 14 
sugar hydroxyl groups, each would account for 
only -0.35 kcal mol ■ . Thermodynamic studies of 
several protein-carbohydrate complexes have 
demonstrated that the presence of some hydroxyl 
groups is essential for binding, whereas others are 
unnecessary or even a hindrance. The net contri- 
butions of individual hydroxyl groups to binding 
generally range from —05 kcal mol" 1 to —2 kcal 
mol" 1 , yet can be as high -6 kcal mol 1 (Lemieux, 
19%; Quiocho, 1993; Street et aL, 1986). Based on 



these values, CBDnx likely forms between 2.5 and 
ten hydrogen bonds to cellotetraose. Given the 
large number of potential hydrogen-bonding sites 
on this sugar and on the side-chains lining the 
binding clefts of CBD N1 and CBD^, it is reason- 
able to^suggest that ceUooligosaccharides, as *w©H 
as amorphous cellulose, could bind these proteins 
in many possible conformations with similar free 
energies. These conformations include opposite 
orientations of the sugar chain across the p-sheet 
faces of- the CBDs, as well as possible multiple reg- 
isters within their binding clefts. 

Further insights into the binding promiscuity of 
the CenC CBDs are also gained from dynamic 
studies of the proteins and protein-sugar com- 
plexes using and 2 H relaxation measure- 
ments. As will be reported elsewhere, the 
backbone amide protons and methyl-containing 
side-chains of CBDnx show motional disorder on 
a nano- to picosecond time-scale that is not 
restricted upon binding cellopentaose. This 
internal mobility or flexibility may also allow the 
cellooligosaccharides to bind in multiple, rapidly 
interconverting conformations. 
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Use of spin labels for obtaining ligand-protein 
distance restraints 

Nitr™ ;H<a - lahf *M rpllpoligosarrhariHpg rlparly 

provide a pow erful method for characterizing pro 
teih-carbohydrat^ interactions, qualitatively, the 
effects of the tree radical on the NMR spectra of 
CBD N1 and CBD^ yielded immediate evidence for 
the binding of TEMPO-Glc 3 and Glc 4 in multiple 
orientations. Although it was also possible to cal- 
culate the effective positions of the nitroxide moi- 
ety bound in a minimum of two possible 
orientations on the CBDs from a quantitative anal- 
ysis of the measured paramagnetic relaxation 
enhancement, the accuracy of these models is lim- 
ited for numerous reasons. These include: (i) com- 
plications associated with multiple conformations 
and partial saturation, as discussed above; (ii) 
the uncharacterized conformational mobility of the 
bound TEMPO group; (iii) inaccuracies in the 
NMR-derived structures of CBD N1 and CBD,^ cal- 
culated without inclusion of the bound Glc 4 or 
Glc 5 , respectively (Johns 011 et oi., 1996b); (iv) errors, 
which tend to lead to an underestimation of longer 
values of r eff , arising from difficulties in measuring 
small values of AR\ and AR 2 , and from the possi- 
bility of relaxation enhancement due to non- 
specific binding or to collisions between CBD- 
TEMPOGlc 4 complexes in solution; and (y) |ho _ 
lark ^nf distance restraints with r &H < 11 A resulting 
fro m the efficiency of paramagnetic relaxation. The 
1*npr problem could be addressed bv studyin g 
CBd^ and CBD^ partially titrated with small 
qua ritttte5~0f TEMFO-GlC 4 or wTEh mixtures of the 
labelecTsu gar in its oxidiied and reduced forms i n 
ordgr To~ identity the amides closest to the bound 
ni troxi dgJI 

To"understand in detail the mechanism of cellu- 
lose binding by the two Family IV CBDs, we are 
currently exploiting glycosynthases to prepare 
selectively 13 C-labeled cellooligosaccharides. These 
compounds will be utilized for isotope-edited and 
filtered experiments aimed at resolving and assign- 
ing intermolecular NOEs between the CBDs and 
the bound sugars. Using a combination of short- 
range distance restraints derived from interproton 
NOE interactions and long-range restraints from 
electron-proton paramagnetic relaxation enhance- 
ments, we hope to define more accurate structural 
models of the ensembles of the complexes of 
CBD N1 and CBD^ with cellooligosaccharides. 

Biological implications 

Studies of numerous cellulases and hemicellu- 
lases indicate that CBDs facilitate hydrolysis by 
passively increasing local enzyme concentrations 
on the surface of cellulosic and hemicellulosic sub- 
strates and, possibly, by actively disrupting non- 
covalent interactions between the chains of these 
polysaccharides (Tomme et ah, 1995). The speci- 
ficity of CBDs for the various ailomorphs of cellu- 
lose (e.g. crystalline or amorphous) will also serve 



to target these enzymes to distinct regions of this 
complex substrate. 

The results presented here demonstrate clearly 
that in isolation, CBD N1 and CBD^q each bind sol- 
uble cellooligosaccharides in multiple orientations. 
This immediately prompts the question as to the 
role played by these CBDs within their native con- 
text. CenC is a 1069 residue fM,4-glucanase, coniK 
posed of CBD N1 and CBD^ arranged in tandem ax 
its N terminus, a central family 9 catalytic domai^ 
and two additional domains of unknown functiok 
at its C .terminus (Coutinho et ah, 1992, 1991). Thk 
enzyme cleaves a variety of cellulosic substrates . 
and, based on an analysis of its hydrolysis of car- 
boxymethyl-cellulose, is semi-processive with both 
endo- and exoglucanase activities(Tomme et ah, 
1996b). Although the structures of the isolated 
family IV* binding domains (CBD N1 and CBD^) 
and the homologous Family 9 catalytic domains 
from Clostridium thermocellum celD (Juy et ah, 1992) 
and Tliernwnionospora fusca E4 (Sakon et a/., 1997) 
have been determined, the spatial arrangement of 
these modules within native C. fimi CenC remains 
to be defined. 

CBD N1 and CBD^ are juxtaposed without an 
intervening linker. Given that the affinity of the 
each domain for cellotetraose is equivalent whether 
joined or in isolation (unpublished), it is likely that 
CBD N 

inc can also bind oligomeric sugars in mul- 
tiple orientations. This theory is reasonable, as the 
binding- clefts of the two homologous CBDs both 
span approximately five glycosyl units and thus 
each domain in CBD N1N2 can independently 
accommodate a single cellooligosaccharide mol- 
ecule. Preliminary studies indicate that the affinity 
pf CBD^ 1N2 for phosphoric acid swollen cellulose 
is only approximately 2 fold greater than that of 
CBD N1 for this polymeric substrate (Tomme et ah, 
1996a). This implies that, when joined in tandem, 
these CBDs bind amorphous cellulose in an addi- 
tive, rather than co-operative, manner. A simple 
interpretation of this result is that the two domains 
comprising CBD N1N2 are structurally constrained, 
due to the lack of a flexible linker, such that they 
cannot bind simultaneously to adjacent regioris*of' 
a single polymer chain. We therefore speculate that 
the tandem CBDs anchor CenC to its natural sub- 
strate by bridging chains of amorphous cellulose 
without a strong preference for their orientations. 
Note however, as discussed above, we can only 
estimate that the relative association constants for 
the binding of TEMPO-Glc 4 to CBE^ or CBD^ in 
the two possible orientations are within a factor of 
approximately five- to tenfold. Thus in combi- 
nation, CBD N1N2 may in fact exhibit a significant 
overall preference for binding strands of amor- 
phous cellulose positioned to match the yet 
unknown orientation of the two domains within 
this tandem CBD. Resolution of this important 
question hinges upon the determination of the 
structure of CBD N1N2 , as well as further character- 
ization of naturally occurring amorphous cellulose 
(Bayer et ah, 1998). 
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CBD N1N2 is connected to the catalytic domain of 
CenC by a short proline-rich polypeptide linker 
sequence (Coutinho et al., 1991, 1992). In contrast 
to the isolated CBDs, it is certain that this catalytic 
domain has a definite orientational requirement for 
the binding and hydrolysis of cellulosic substrates. 
Although the exact boundaries and thus length of 
the linker sequence is unknown, the observation 
that it is highly sensitive to proteolytic cleavage 
(Tomme et al, 1996b) suggests that the linker 
affords at least some degree of flexibility between 
the binding and catalytic domains of CenC. This 
may allow the cleavage of properly oriented chains 
of amorphous cellulose other than those to which 
the CBDs are bound. Processivity would result 
from the diffusional sliding of both the binding 
and catalytic domains along the substrate cellulose 
chains. Enzymatic studies of CenC with and with- 
out CBD N1N2 , as well as structural characterization 
of the catalytic domain, are clearly necessary to 
understand the role of these Family IV CBDs in 
modulating the activity of this C.fimi p-l,4-gluca- 
nase. 

Although over 180 putative CBDs in 13 families 
have been identified, little is known about the 
detailed mechanism of cellulose and hemicellulose 
binding by these protein domains. Studies of sev- 
eral family 2 CBDs, such as those from C. fxmi Cex, 
have demonstrated that binding to crystalline cel- 
lulose is mediated by conserved aromatic residues 
aligned along a flat face of the molecule (Xu et al, 

1995) . Calorimetric measurements have proven 
that binding is entropically driven(Creagh et al, 

1996) , indicative of hydrophobic stacking of the 
aromatic side-chains on the glucose rings. Hydro- 
dynamic studies have also revealed that CBD Cex 
diffuses readily across the surface of crystalline cel- 
lulose Qervis et al, 1997). We therefore speculate 
that CBD Cex/ as well as members of other CBD 
families, also binds in multiple orientations to cel- 
lulose. In contrast, based on the crystal structure of 
a fragment of the endo/exocellulase E4 containing 
a family 3c CBD and a family 9 catalytic domain, 
Sakon et al. (1*997) have hypothesized that the bind- 
ing domain aids in the activity of this enzyme by 
helping to feed a cellulose chain, bound in a 
defined orientation, into the active site of the cata- 
lytic domain. This distinctive behaviour may result 
from the intimate structural association of the two 
domains in the T. fusca endo/exocellulase. 

In summary, we have shown that CBD N1 and 
CBD^ from C. fimi CenC bind TEMPOlabeled eel- 
looligosaccharides in multiple orientations. This 
promiscuous mode of sugar recognition can be 
attributed to the approximate symmetry of the 
hydrogen bonding groups on cellulose and on the 
side-chains lining the binding clefts of these two 
CBDs. Further structural studies of both native 
CenC and its substrate, amorphous cellulose, are 
necessary to understand the mechanisms by which 
this and other cellulolytic enzymes function in the 
efficient degradation of biomass. 



Materials and Methods 

Synthesis of TEMPO-labeled celloollgosaccharldes : . 

Details of the synthesis of TEMPOGlc 3 and TEMPO- 
G\c 4 will be provided elsewhere. Ionspray mass spec- 
troscopy of the final products yielded 658 (M+), 659 
(M + 1) and 660 (M 4- 2) for TEMPO-Glc3 (calculated for 
C^H^NO^: 658.7) and 820 (M+), 821 (M + 1), and 822 
(M + 2) for TEMPO-GIC4 (calculated for C^H^NO^: 
820.8). Useful 'H-NMR data could not be obtained for 
these products due to the presence of the spin label 
Structural characterization was therefore achieved by 
reduction of the nitroxide group to a hydroxy la mine and 
acerylatipn of the entire molecule. The spectral data for 
these derivatives of TEMPOGlc 3 and TEMPO-GIq, 
respectively, are provided below. 

2,2,6,6-Tetramethytpiperidin- Uacetoxyf-4-yi 2,3,4,6-tetra- 
0-acetyt$-D-glucopyranosyl-(1^> 4)-2,3,6-triO -acetyl- 
fi'D-glucopyranosyt-(1-+ 4)-2 f 3,6-triO-acetyl-$- 
D-glucopyranoside 

A H NMR (500 MHz, CDC1 3 ) 6: 5.14 (dd, 1 H, ]i A 
9.6 Hz, H-3), 5.13-5.09 (m, 2 H, H-3' H-3"), 5.05 (dd, 1 H, 
/ 4 - r 9.5 Hz, H-4"), 4.89 (dd, 1 H, ] T2r 8.8 Hz, ) T r 
8.1 Hz, H-2"), 4.85 (dd, 1 H//„ 9.5 Hz, J r r , 8.7 Hz, H- 
2'), 4.83 (dd, 1 H, 9.6, Hz / 2 , 7.3 Hz, H-2), 4.60-4.47 
(m, 4 H, H-l H-l' H-l" H6a), 4.42 (dd, 1 H, 
11.7 Hz, J,* . y 1.5 Hz, H-6a'), 434 (dd, 1 H, / 6a W - 
12.4 Hz,^ -jsr 4.4 Hz, H-6a"), 4.14 (dd, 1 H, 4.5 Hz, 
H-6b ), 4.07 (dd, 1 H, J M 5.9 Hz, H-6b), 4.04 (dd, 1 tf, 
1.6 Hz, H-6b"), 3.93 (m, 1 H, H-l'"), 3.76 (dd, 1 6, 
/ 4 . s 9.4«z, H-4'), 3.71 (dd, lH,/ 45 9.5 Hz, H-4), 3.65 
(m, 1 H, H-5"), 3.63-358 (m, 2 H, H-5 H-5'), 2.14-1.98 (33 
H, 11 x Ac), 1.83-138 (m, 4 H, H-2"' H-3'"), 1.13 (6 % 
CH 3 ), 1.10 (6 H, CH 3 ). MS (Ionspray) 1123 (M + 1); cal- 
culated for C49H71NQ28 (1122.1). 

2,2,6,6-Tetramethylpiperidin- 1 -acetoxyl-4-yl 2,3,4,6-tetra- 
0-acetyh$-D-glucopyranosyl-(1-+ 4)- 2,3,6-triO-acetyl- 
$-o-gluc§pyranosyl-( f-* 4)- 2,3,6-tri-0-acetyl-$- 
D-glucopyranosyt -(1^ 4)- 2 t 3,6-triO-acetyl-$- 
D-glucopyranoside 

l H NMR (400 MHz, CDC1 3 ) 6: 5.15 (dd, 1 H, } X4 
9.5 Hz, H-3), 5.12-5.08 (m, 3 H, H-3' H-3" H-3'"), 5.04 
(dd, 1 9.5 Hz, H-4'"), 4.89 (dd, 1 H, J 2 ^ 8.9 Hz,. 

/ 2 ~ r 8.0 -Hz, H-2"'), 4.86- 4.81 (m, 3 H, H-2 H-2' H^T, 
4.6(M.47 <m, 5 H, H-l H-l' H-l" H-l'" H6a>, 4.60-432 
<m, 3 H, H-6a H-6a" H-6a"'), 4.15-1.00 (m, 4 H, H-6b H- 
6b' H-6b" H-6b ") 3.95 (m, 1 H, H-l""), 3.81-3.68 *(m, 3 H, 
H-4 H-4" H-4""), 3.66 (m, 1 H, H-5"'), 3.62-3.57 (m, 3*ft, 
H-5 H-5' H-5"), 2.15-1.96 (42 H, 14 x Ac), 1.85-1.37 (m, 4 
H, H-2"" H-3""), 1.15-1.10 (12 H, 4 x CH 3 ). Ionspray MS 
141 1 (M + 1 ); calculated for C^H^NO* (1410.34). 

Protein samples j: j 

The l5 N-labeled CBD N1 (residues 1-152 of CenC) and 
CBD N2 (residues 146-296) were produced by expression 
of the plasmids pTugNl and pfugN2 in Escherichia cali 
JM101 cells grown in minimum media containing 1 g/1 
of 15 NH 4 C1 and 1 g/1 of 99% 15 N-labeled Isogro (Isotec 
Inc.; Johnson et al., 1996a). The secreted proteins were 
purified from the culture supernatant and the periplas- 
mic fraction using both affinity chromatography on cel- 
lulose (Avicel) and ion-exchange chromatography, as 
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described (Johnson et al., 1996a). Samples of CBO^ and 
CBDn2 were exchanged into a final buffer of 50 mM 
sodium chloride, 50 mM sodium 2 H 3 -acetate, 3 to 5 mM 
CaCl 2 , and 0.02% sodium azide in 10% 2 H 2 O/90% H 2 0 
at pH* 6.1 using a microsep concentration device (Fil- 
tron). CBD N1 binds calcium with an equilibrium associ- 
ation constant of MO 5 M* 1 at pH 6.0 and thus exists as a 
1:1 CBDMi-Ca"* -2 complex under these conditions 
Qohnson et al., 1998). The metal ion stabilizes the folded 
structure of CBDmj but does not alter its affinity for cel- 
looligosaccharides. In contrast, CBD^ does not bind cal- 
cium appreciably. 



completion of the data collection with either CBDni or 
CBD N2 bound to the paramagnetic cellooligosaccharide, 
the nitroxide functionality was reduced to the diamag- 
netic hydroxylamine by the addition of two molar 
equivalents of solid L-ascorbic acid (Sigma). The pH* jbf- 
the sample was re-adjusted to 6.1 and the spectra necessr 
ary to measure ARj and AR 2 were recorded. All spectra 
were acquired as 1024 x 96 complex points with spectral 
widths of 6500 and 1450 Hz in the *H and 15 N dimen- 
sions, respectively. Selective water flip back pulses were 
utilized to ensure minimum perturbation of the water 
magnetization (Zhang et al., 1994; Grzesiek & Bax, 1993). 



Titrations with TEMPO-labeled cellooligosaccharides 

The binding of the TEMPO-labeled cellooligosacchar- 
ides to the CenC CBDs at pH* 6.1 and 35 °C was 
measured quantitatively using l H- 15 N HSQC spec- 
troscopy. Stock solutions of TEMPO-Glc 3 and TEMPO 
Glc 4 were prepared by weight in the above buffer and 
added in small aliquots to the protein samples. The 
titration of CBD N1 with TEMPOGlc 3 was carried out by 
progressively adding the labeled sugar to 0.32 mM pro- 
tein up to a final ligand-to-protein ratio of 22:1. The 
titrations of 0.65 mM CBD N] and 0.48 mM CBD^ with 
TEMPO-Glc 4 were taken to final corrected ligand-to-pro- 
tein ratios of 22:1 and 1.4:1, respectively. Protein concen- 
trations were measured using t m ) = 21370 M" 1 cm" 1 
and 20500 M" 1 cm" 1 for CBD N1 and CBD^ respectively 
(Johnson et al., 1996a). Equilibrium association binding 
constants were determined by non-linear least-squares 
fitting of the J H N and 15 N chemical shifts of numerous 
amide protons versus cellooligosaccharide concentration 
to the Langmuir isotherm describing the binding of one 
ligand molecule to a single protein site(Johnson et aL 
1996a). These amide protons, which include Gly7, GlylS, 
Val34, Gly44, Thr65, Gln80, Asn81, Thr87 and Glyl30 for 
CBD N , and Vall97, Tyrl98, Gly231, Tyr234 and Ala237 
for CBD N2 , exhibit fast exchange between the free and 
bound states on the chemical shift time-scale and are 
detectable, albeit with diminishing intensity, over the 
course of the entire titration. In the cases of the TEMPO- 
Glc 4 titrations, good fits of the experimental data points 
to the binding isotherms were obtained only if scaling 
factors of -13 (for CBD N2 ) or 1.8 (for CBDnj) were intro- 
duced to correct for apparent errors in measuring the 
weight of the sugar (M mg) during the preparation of 
stock titrant solutions. The reported binding constants 
and errors are the averages and standard deviations of 
the individual values measured for each 1 H N and 15 N 
nuclei. 



NMR spectroscopy 

Spectra were acquired on a Varian Unity 500 MHz 
spectrometer at 35 °C. Each point of the titration of the 
^-labeled proteins with TEMPO-Glc 3 or Glc 4 was mon- 
itored by the acquisition of a sensitivity-enhanced gradi- 
ent 1 H-' 5 N HSQC spectrum (Kay et al., 1992). Upon 
formation of the sugar-protein complexes, i H N T } values 
were measured using a sensitivity-enhanced gradient 
'H-^N HSQC sequence as a read-out of a non-selective 
inversion-recovery experiment recorded with delays of 
f = 0, 0.1, 0.2, 0.4, 0.8 and 2 seconds. The f = 0.1 spec- 
trum was repeated to help estimate experimental error. 
A non-sensitivity-enhanced gradient ] H- i5 N HSQC spec- 
trum was then recorded to obtain the spin-spin relax- 
ation enhancements, AR^ of the amide 1 H N nuclei. After 



Calculation of A/?, and Aflj 

NMR spectra were analyzed using a combination of 
Felix (Biosym Technologies) and NMRPipe (Delaglio 
et al., 1995). Data for the *H Tj series were processed 
with mild Lorentzian-to-Gaussian apodization. The rela- 
tive peak volumes, V t , at each inversion-recovery delay t 
were fit ito the function V, = V n exp(— t/T x ), as described 
by Farrow et al. (1994). V n is the volume at time t — 0, 
and errors in the measured T x lifetimes were estimated 
using Monte Carlo simulations. The paramagnetic 
enhancement of each amide proton spin-lattice relaxation 
rate, A/fj, was calculated as: 
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Ox and red indicate the data collected with the labeled 
cellooligosaccharides in the nitroxide and hydroxylamine 
forms, respectively. 

The non-sensitivity enhanced *H- 15 N HSQC spectra 
were processed without apodization in the proton 
dimension and with a mild exponential linebroadenirtg 
window" in the nitrogen dimension. Two methods were 
used to measure individual values of AR 2 . In the first, 
the ratio -of the peak volumes recorded for an amide in 
the presence of the nitroxide (V ox ) versus the hydroxy la- 
mine-labeled ( V^j) cellooligosaccharide is given by: 
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where f — 1 0.1 ms is the total time during the INEPT 
and reverse-INEPT components of the HSQC pulse 
sequence in which j H N magnetization is in the transverse 
plane akd thus subject to paramagnetic relaxatiojju 
A non-gradient enhanced HSQC was utilized to avoid . 
possible complications due to the use of two reverse- 
INEPT sequences. By measuring total peak volumes, and , 
not heights, the effect of differential T 2 relaxation during 
the detection periods of the spectra recorded in the pre- 
sence of the oxidized and reduced sugars can be neg- 
lected. Peak volumes were determined by direct 
integration over a manually adjusted "footprint" and 
not by fitting of an idealized peak shape. Equation © 
can be rearranged to obtain AR 2 . vo i : 

In the second method, AR 2 xw were obtained from the 
change in the linewidths of tine A H- 15 N cross-peaks in the 
proton dimension: 

AR 2 .lw = ~{fl) t J = nLWm " LWred) (4> 
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where LW TO and LVV^ are the full half-height line- 
widths in the presence of the nitroxide and hydroxyla- 
mine-labeled cellooligosaccharides, respectively. The 
line- widths were measured by non-linear least-squares 
fitting of vectors, extracted in the proton dimension 
through a given cross-peak, to the equation for a single 
Lorentzian peak split by a 3 /hn-Ho coupling using the 
program PLOTDATA (TRIUMPH, UBC, Vancouver, 
Canada). The coupling constant was either treated as an 
independent variable or fixed at the value measured pre- 
viously (Johnson et al., 1996b) using the HNHA or 
HMQC-J experiments (Bax et al., 1994; Kay & Bax, 1990). 
Significant errors in the measurement of AR 2 .lw arise if 
this coupling is ignored. The effects of the free electron 
on the relaxation rates of the amide nitrogens are neg- 
lected due to the small magnetogyric ratio of the 15 N 
nucleus relative to that of 1 H. 



Calculation of 

The magnetic interaction of an unpaired electron 
and a proton is described by the modified Solomon- 
Bloembergen equations (Solonon & Bloembergen, 
1956).The enhancement of the longitudinal (AR^ and 
transverse (AR 2 ) relaxation rates of a proton due to the 
spin label are given by the equations (Kosen, 1989; 
Gellespie & Shortle, 1997a): 



ARz = A (^) = ^( 4tc 
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where K is the constant 1.23 x 10" 32 cm 6 s" 2 for a nitrox- 
ide radical, r is the distance between the electron and 
proton, x c is the correlation time for the fluctuation of the 
electronic-nuclear dipole-dipole interaction, and u>h is 
the Larmor frequency of the proton. These equations are 
based on the assumptions that the vector between the 
electron and proton is free to undergo isotropic 
rotational diffusion, and that the distance r is constant. 
The value of t c for each individual amide can be deter- 
mined directly from the ratio of AR 2 to ARi by combin- 
ing equations (1) and (2) to obtain: 



Kith 



1/2 



(7) 



With a value of x c in hand, the distance between the pro- 
ton and electron, r, is calculated by rearranging equation 

(2): 



"A ( 4Tc + a+ix?)) 



(8) 



Individual values of x e# rather than a global average, 
were utilized to reflect directly the experimentally 
measured relaxation enhancements for each amide. For 
an ensemble of in tercon verting protein-ligand com- 
plexes, the measured distances will reflect the weighted- 
average of the individual electron-proton distances and 
thus are denoted as r rff . 

Standard deviations for AK„ AR 2 , t c and r rff were cal- 
culated by conventional error propagation (Bevington & 
Robinson, 1992). 



Structure modeling 

The positions of the nitroxide moiety in TEMPOGlc 4 
bound to CBDn! or CBD^ were estimated using ; ; a 
simple restrained minimization protocol provided with 
X-PLOR r v.3.8 (Briinger, 1992). The only attractive force 
was the soft-square potential of the distance restraint* 
energy term (£ NO e) and the only repulsive force was that 
corresponding to van der Waals interactions (£ vdw ). The 
bounds on the effective electron-proton distances, r M , 
were defined for three cases. First, for eight amide pro- 
tons in CBDn, and eight in CBD^ for which AR 2 values 
could not be obtained due to severe paramagnetic line- 
broadening (thus r rff < 11 A) , the distance bounds were 
set to the range of 1-11 A. Second, for 27 amide protons 
in CBDisn and 26 in CBD^ for which the measured r rff 
were 11 to 15 A, the distances were restrained to 
r rff ± 3 A. Third, for 54 amides in CBD N1 and 59 in 
CBD N2 with r^, > 15 A, the distance bounds were set 
from 13 A to an arbitrarily high value of 45 A. Although 
distances between 11 and 23 A were measured for 
CBDkh and CBDn 2 (Figure 6), it was found that those 
>15 A were consistently underestimated in light of the 
dimensions of these proteins. Therefore, values greats 
er than 15 A were used only to derive lower bounds to 
restrain the nitroxide atoms from approaching the corre- 
sponding amide protons in the two protein structures. 

To account for multiple positions of TEMPOGlc 4 
bound to CBDjsjj or CBDm 2 , ensemble-averaging (Bon v in 
& Briinger, 1996) was applied to coordinates systems 
containing one, two, or three nitroxides, defined as single 
atoms with a mass of 30 amu and a van der Waal radius 
of 23 A. 1 The co-ordinates of the protein (pdb file 1UIX5 
for CBDn, and a low-energy structure for CBD^ 
(unpublished)) were held fixed, and each ensemble of 
nitroxide groups was minimized 60 times starting from 
random positions. The quality of the calculated struts 
hires were estimated by the NOE energy penalty due jb 
distance restraint violations. 
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The geometric relationships between ligands and the functional groups 
that bind ligands in soluble ligand-protein complexes have traditionally 
been deduced from distance constraints between pairs of NMR active 
nuclei spanning the ligand-protein interface. Frequently, the steep inverse 
distance dependence of the nuclear Overhauser effect (NOE), from which 
the distance constraints are derived, makes identification of sufficient 
numbers of constraints difficult. In these cases the ability to supplement 
NOE-derived information with distance-independent angular information 
can be very important. Here, the observation of residual dipolar coup- 
lings from ot-methyl mannose bound to mannose binding-protein in a 
dilute liquid crystalline medium has allowed the deterrnination of a 
bound ligand's average orientation. The 3-fold rotational symmetry of 
mannose-binding protein defines its orientational tensor and obviates the 
need to determine experimentally the protein's average orientation. 
Through superimposition of ligand and protein orientational tensors we 
describe the binding geometry of ot-methyl mannose bound to mannose- 
binding protein. This new method is of general applicability to the study 
of ligands bound to proteins, and it is of particular interest when neither 
X-ray crystallography nor NOE techniques can provide sufficient infor- 
mation to describe binding geometries. 

© 1999 Academic Press 

Keywords: dipolar couplings; NMR; structure detennination; carbohydrate 
interactions; mannose-binding protein 



Introduction 

We illustrate here a new approach for the deter- 
mination of ligand geometry in protein binding 
sites that relies on orientational constraints derived 
from residual dipolar couplings observed in NMR 
spectra of partially ordered macromolecular com- 
plexes (Bothner-By, 1995; Prestegard, 1998; 
Prestegard et al, 1998; Tjandra & Bax, 1997; 
Tolman et al, 1995). The determination of the 
bound geometry of protein ligands is an important 
issue for improving our understanding of how 
structure relates to biological function, and also for 
building a basis for rational drug design. Many 
useful approaches to this determination have been 

Abbreviations used: NOE, nuclear Overhauser effect; 
MBP, mannose-binding protein; CRD, carbohydrate 
recognition domain; AMM, ot-methyl mannoside; HSQC, 
heteronuclear single quantum coherence. 

E-mail address of the corresponding author: 
jpresteg@ccrc.uga.edu 



developed in the past, including X-ray diffraction 
of crystalline protein-ligand complexes and NMR- 
based transfer NOE studies of complexes in sol- 
ution (Ni, 1994). All the approaches are limited in 
certain ways. In the case of X-ray crystallography, 
complexes must be crystallizable; in the case of 
transferred nuclear Overhauser effect (NOE) stu- 
dies, ligands must be in rapid exchange. Transfer 
NOE studies, while giving sound information on 
ligand geometry, also suffer from a lack of infor- 
mation on the nature of ligand protein contacts. 
Information on these contacts can be provided by 
more conventional NOE studies using both protein 
and ligand resonances in some cases, but this infor- 
mation is often difficult to obtain because it 
requires assignment of protein as well as ligand 
resonances, and because it presumes sufficiently 
close approach of protein and ligand protons to 
allow efficient magnetization transfer (r < 5 A). 

Cases where detennination of bound ligand geo- 
metry have been particularly difficult include those 
involving protein-oligosaccharide interactions. 
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These cases encompass an important area in that 
protein-oligosaccharide interactions mediate a 
variety of processes essential to cell function and 
survival. To name a few, sialyl Lewis* ohgosac- 
charide-selectin interactions mediate the recruit- 
ment of leucocytes to sites of injury ^. The 
interaction of choferatoxin with the ohgosacchande 
headgroup of ganglioside CM, mediates internakz- 
afcxTof toe toxfc subunit, and p^^™* 
galectin interactions are hypothesized to be 
fi^o^l in the metastasis of malignant cells 
^1992V Characterization by traditional NMR 
'^^^olikosaccr^ interactions 
^tH«processes is difficult because the 
^widing networks involving hydroxyl 
S^^iugars are often part of the interface 
^w^^protein wid oligosaccharide. The hydroxyl 
^SS^then the ones in closest proximity to 
K^jprotons, but these exchange rapidly with 
Tfi ^bulk water making their NMR reson- 
ances hard to observe. In these difficult cases, 
il' dipolar data can offer a valuable alterna- 
^wiltfhere ^he data can in principle constrain both 
Jw^tteand geometry and ligand orientation 
iS^ tb the protein binding site. 
l^^^&rSaJar target in the studies presented 
&'^wm^mihng protein-A (MBP). MBP is 
a protein important in innate immune response. It 
binds sugars, including a-mannosides, that are fre- 
quently exposed on the surface of invading patho- 
>~: gens, and subsequently triggers a complement 
^ cascade (Ikeda et al, 1987). Improper activation via 
p5P:^MBP has also been suggested as a factor in inflam- 
matory disease (Dwek, 1996). Crystal structures of 
several forms of MBP are available. The form we 
study here is a 149 residue truncated version that 
includes a caldum<ontaining carbohydrate recog- 
nition domain (CRD) and a short ot-helical stem 
that initiates the formation of a homo-trimer by 
making a left-handed coiled<oil with helical stems 
from two other MBP monomers. The 3-fold sym- 
metry axis in the resulting homo-trimeric structure 
is important for the interpretation of our results. 
The binding of several ligands to MBP and mutant 
MBPs that mimic selectin proteins has also been 
studied, both because of interest in MBP's ligand 
binding properties and those of proteins which 
show homology to the MBP CRD, such as the 
selectins. Crystallography has been the primary 
contributor in studies of MBP trimers, with struc- 
tures existing for bound galactoside and galactosa- 
mine, as well as sialyl Lewis-X and related ligands 
(Chang et al, 1994; Iobst et al, 1994; Kolatkar AR, 
1998; Ng & Weis, 1997; Weis & Drickamer, 1994). 
Additionally, there are structures of a shorter frag- 
ment of MBP which makes a homo-dimer; one of 
these includes a-methyl mannoside in the carbo- 
hydrate binding site (Ng et al, 1996). The results of 
the above crystallography studies provide a useful 
basis for evaluating data from our new approach. 

The new approach that we present is based on 
the observation of residual dipolar coupling of 
directly bonded l H - 13 C pairs in a-methyl manno- 
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side (AMM) when the mannoside is bound to MBP 
in a field-oriented aqueous liquid crystal. The use 
of residual dipolar coupling in the characterization 
of the structures of proteins and other macromol- 
ecules in solution has received a good deal of 
attention recently (Clore ft al, 1998a; Tjandra it 
Bax, 1997) . When molecules are partially ordered 
in a magnetic field interaction, vectors connecting 
magnetic nuclei in the molecules .depart from a 
complete isotropic sarnplir^ 
space. This gives rise to a m 
of NMR resonances ^t^iependatMl^i^mMe 
properties 6f the Mm^^mS^^S^^^i 
separation (r) and ' ^ri^m(^^m^mcior S 
makes with the rr*gr*l^ 
are directly bonded as te Y l H^^Hl^|ilS; 
the 1/r 3 distance-dependence of the interaction can 
be regarded as being determined by the bond 
length and the primary variable becomes the orien- 
tation of the bond relative to the magnetic field as 
characterized by the angle 0. The functional depen- 
dence of the dipolar couplings is (1/2(3 cos 2 
0 — 1)), where the parentheses denote an average 
over an orientation distribution, and the resulting 
interaction appears as an addition to the normal 
one bond scalar couplings. These through space 
interactions are always present; in fact, they are the 
basis of the common NOE interaction. However, 
the angular function above is averaged to zero 
when space is isotropically sampled and direct 
splittings of resonances only appear in partially 
oriented media. Fortunately, several means of pro- 
ducing field-ordered states are now available, 
including inherent orientation due to the large ani- 
sotropics in the magnetic susceptibilities of some 
molecules (Kung et al, 1995; Tolman et al, 1995), 
orientation due to the interaction of molecules with 
lipid bicelles that form field-oriented liquid crystals 
(Tjandra & Bax, 1997) and interaction "with filamen- 
tous bacteriophage that also form field-oriented 
liquid crystals (Clore et al, 1998c; Hansen et al, 
1998). 

The interpretation of residual dipolar data has 
taken two routes, incorporation of individual bond 
constraints as penalty functions in simulated 
annealing protocols for molecular structure deter- 
mination (Clore et al., 1998a,b), and extraction of 
order tensors that directly describe the direction 
and level of ordering forces from the point of view 
of a coordinate frame fixed in a rigid molecular 
fragment (Losonczi et al, 1999; Losonczi & 
Prestegard, 1998b; Saupe, 1968). The latter 
approach which arose from substantial previous 
work in the liquid crystal field, proves to be par- 
ticularly valuable for the problem at hand. An 
order tensor is a 3 x 3 matrix with elements 
(1/2(3 cos 9, cos 9j f - 5 /; )) written in terms of direc- 
tion cosines that relate Cartesian axes of an arbitra- 
rily chosen molecular axis system to the magnetic 
field direction- Because the matrix is traceless and 
symmetric, there are only five independent 
elements. An order matrix in an arbitrarily chosen 
frame of a molecular fragment is difficult to inter- 
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pret, but if diagonal ized, the five independent vari- 
ables turn into a principal order parameter, an 
asymmetry parameter and three Euler angles relat- 
ing the diagonal, or principal frame, to the original 
molecular frame. If fragments of interest experience 
a common source of order, as they would if they 
were parts of a rigid ligand-protein complex, the 
directions and levels of the orienting force should 
appear the same from the point of view of each 
fragment when a model for the fragments is 
assembled with proper fragment orientations. This 
provides a test for the validity of proposed models 
for the orientation of a ligand (one fragment) in a 
protein binding site (another fragment). 

The key to using this test is having an approxi- 
mately rigid fragment with enough potential dipo- 
lar coupled pairs to determine the five elements of 
an order tensor. For a 15 N-labeled protein, pairs of 
^-^N spins are clearly abundant and much of the 
protein backbone can be considered rigid. In oligo- 
saccharides, individual pyranose rings frequently 
prefer a *Ci configuration which can be considered 
to have approximately rigid geometry and ^-"C 
pairs are abundant In a-methyl mannoside there 
are five directly bonded ^-^XT pairs on the 
ring, two of which point in two unique directions 
and three of which depart slightly from a third 
direction. These pairs can provide the data for an 
order tensor determination. There are a number of 
ways of accurately determining the splittings of 
resonances from these pairs (Tolman & Prestegard, 
1996) but the easiest is simply to collect *H- 13 C het- 
eronuclear single quantum coherence (HSQC) spec- 



tra, without decoupling in one of the frequency 
domains. This is the approach we use here. We 
used uniformly 13 C-labeled a-methyl mannoside in 
order to improve sensitivity. 

Results and Discussion 

Measurement of residual dipolar couplings in 
AMM complexed to MBP 

The l H- l Kl HSQC NMR spectra of ,3 C^enriched 
AMM in the presence of an equimolar amount of 
MBP dissolved in a dilute bicelle medium are 
shown in Figures 1(a) and (b) at temperatures of 
25 °C and 39 °C, respectively. The data were 
acquired in the absence of l3 C decoupling during 
acquisition, and the one bond 13 C- l H couplings are 
shown as frequency domain splittings in the pro- 
ton dimension. The magnitudes of these couplings 
and the corresponding errors extracted using a 
Bayesian parameter estimation method (Andrec & 
Prestegard, 1998) are shown on the spectra. As one 
would expect for the case of isotropic tumbling at 
25 °C, all the couplings in Figure 1(a) are similar 
and fall in the range of typical Vch scalar coupling 
constants for non-anomeric sites in carbohydrates 
(140-150 Hz). On the other hand, couplings 
measured at 39 °C are more widely spread because 
of the presence of a residual dipolar contribution. 
Residual dipolar contributions for individual C-H 
bond vectors calculated from the differences in 
couplings measured at 25 °C (isotropic) and 39 °C 
(aligned) are shown in Table 1 (D^. 
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Figure 1. (a) Section of a proton-coupled HSQC spectrum of isotropic AMM in MBP with selected couplings anno- 
tated, (b) Section of a proton-coupled HSQC spectrum of oriented AMM in MBP with selected couplings annotated. 
The anomeric regions of the spectra are not shown but show splittings of 169.4(±0.5) Hz for isotropic condition and 
158.6(±0.2) Hz for the oriented case. 
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Table 1. Residual dipolar couplings for bound and free AMM 



Data Set 



C,-H, (Hz) 



<VH 2 (Hz) 



AMM + MBP + bicelle 
AMM + bicelle (D^) 
AMM-bound state (D h , HtfvJ ) 



C 3 -H, (Hz) 



C 4 -H< (Hz) 



-10.8±0.7 
-3.5±0.7 
-22.6±2.2 



-15.2±07 
-7.3±0.7 
-28.7±2.2 



13,3±1.2 
9.2±0.9 
20±33 



16.6±1 

12.1±0.7 

23.9±2.9 



C r H 5 (Hz) 



16.4 ±1.1 
7.8±05 
3O5±3.0 




While residual dipolar coupling contributions 
are indeed observed in the AMM/ MBP complex 
dissolved in a bicelle medium, they need not orig- 
>;inate fuBy from binding of AMM to a strongly 
^lUj^ MBP molecule. We see only a single set of 
i^^iKesrbut for a weak binding ligand in fast 
^^uitte that spends a portion of its time bound 
MS^Itefwten and the remainder free in solution, 



m 



if^thls^wul always be the case. Thus, observed split- 
tings will be a population weighted average of 
those induced in the free and bound states. 
Residual dipolar couplings have previously been 
observed in free carbohydrate molecules dissolved 
in bicelle media (Bolon & Prestegard, 1998; Kiddle 
& Homans, 1998; Rundlof et al, 1998). While small, 
they can be significant. In order to quantify the 
|^cimtribution from the AMM-free state, the same 
^experiments were repeated under identical con- 
ditions in the absence of MBP. The differences in 
measured couplings between 25 °C and 39 °C are 
also shown in Table 1 (D^. Again, couplings and 
associated errors were extracted using a Bayesian 
parameter estimation method. It is noteworthy that 
the magnitudes of couplings for various C-H bond 
vectors are generally smaller for free AMM 
•^m^-b. samples, but the reduction varies from site to site. 
§fll§|p[he different pattern of spHrtings assures us that 
we are' not looking at a simple scaling of bicelle- 
induced order and that MBP has a significant 
direct effect on both extent and direction of AMM 
alignment. This direct binding effect was con- 
firmed in another set of experiments. Here, a molar 
ratio of 1:8 MBP: AMM was used, which would 
reduce the bound AMM state to approximately 
8%. Indeed, in this case where indirect effects 
would dominate, the presence of MBP had little 
measurable effect on the observed residual dipolar 
couplings. 

Interpretation of residual dipolar contributions 
1 of a ligand in a protein-bound state will, in gener- 
|al, require separation of the contribution from the 
free state using known binding properties. The 
I observed residual dipolar couplings measured in 
the AMM/MBP complex (Table 1, D^) will be a 
population weighted average from the free (N^) 
and bound (Mbo^) states, such that: 



2^>. ■ 

% 



Dobs = NfreeDfree + Abound Abound 



(1) 



fraction of AMM free and bound (N^ and 
i) can be calculated from the dissociation con- 
and known protein-ligand concentrations, 
a previously determined binding constant 
| ? 1 mM (Sayers, 1998), approximately 40% of 
|is in fact bound to MBP leaving approxir 



mately 60% of AMM free in solution. Residua! 
dipolar couplings originating from the bound state 
( D bound) calculated from equation' (I) are shown in 
Table 1. Since these dipcSf^^^SSSLte 
from MBPs pan^^^uy^^|»»i^^ 
desired orientationaTinfb^ 
to MBP. This iruonnatibri tiWtM^Wrtk an 
order matrix analysis^ -^ 'f 3fe®S ^ 



Order tensor calculation In AMM 

Using AMBER-iriinimized structure coordinates 
for AMM (Woods et al, 1995) and five residual 
dipolar couplings measured in the sugar ring in 
simple bicelle solution, the five elements of the 
order matrix were determined using a singular 
value decomposition approach (Losonczi et al, 
1999). The initial molecular coordinate frame for 
AMM was defined by placing the y-axis along the 
Cl-Hl vector and the z-axis normal to the plane 
defined by Hl-Cl-OMe. Using this frame, the pos- 
ition of the principal order frame axes determined 
from residual dipolar couplings from free AMM 
(^free/ Table 1) is illustrated in Figure 2(a) using a 
Sauson-Haumsteed projection (Bugayevskiy & 
Snyder, 1995). The alignment tensor is extremely 
asymmetric and the directions of all three principal 
axes of the alignment tensor are well defined, 
except that inversion of any axis is allowed. The 
asymmetry is quantitatively defined by an asym- 
metry parameter eta (n) that can be written in 
terms of order parameters S i; (n = (S^ - S^/S^). 
It is equal to 0.8(±0.13). Asymmetric behavior is 
not unusual and has, in fact, been observed in 
many systems dissolved in bicelle media. Due to 
the high level of asymmetry, the direction of high- 
est order (SJ can alternate between two orthog- 
onal orientations (Figure 2(a)) for which 
assumes values similar in magnitude but opposite 
sign (S^ = + 0.0005 or -0.0005). Note that one of 
the possibilities for the higher level of order is 
nearly along the x-axis of the initial molecular 
coordinate frame (the center of the Sauson-Flaum- 
steed plot). Hence, it is nearly in the plane of the 
Hl-Cl-OMe fragment and perpendicular to the 
C-H vector. 

The five elements of the order tensor for AMM 
detennined from couplings for the MBP-bound 
state (0^^) w ere also determined using a singu- 
lar value decomposition method, and the direc- 
tions of the principal axes are depicted in 
Figure 2(b). While the position of the z-axis, the 
most strongly ordered axis, is well defined, the 
position of the and Syy axes are ambiguous. 
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1999). However, careful consideration of MBP's 
structure, and in particular, its symmetry, allows 
us to omit this independent interpretation and 
indeed explain the appearance of axial symmetry 
depicted in Figure 2(b). 

MBP's alignment tensor: a solution 
In symmetry 

The form of MBP^used >.in .these studies is a 
homo-*rimer wit^^ and 




Figure 2. (a) Sauson-Flaumsteed projection of the 
directions of highest order for oriented AMM without 
MBP. (b) Sauson-Flaumsteed projection of the directions 
of highest order for oriented AMM in the presence of 
MBP, with a clear depiction of axial symmetry. The 
molecular coordinate frame for AMM was defined by 
placing the y-axis along the Cl-Hl vector and the z-axis 
normal to the plane denned by Hl-Cl-OMe. 



This is a characteristic of an axially symmetric 
order tensor. The asymmetry parameters confirms 
this axially symmetric nature of the alignment ten- 
sor with T| = 0.24(±0.16). In addition, the level of 
order (S M = - 0.0012(±0.00012)) is approximately 
2.5 times larger than that for AMM in free solution, 
and the direction of highest order is shifted 40° 
toward the molecular z-axis (out of the plane of to 
the Hl-Cl-OMe fragment). 

The direction of alignment for the bound ligand 
is most useful when referenced to the alignment of 
MBP. We know that the direction of highest order 
(Szz) depicted in Figure 2(b) should coincide with 
the direction of highest order for MBP itself. Rotat- 
ing the molecular frame of the sugar to achieve 
coincidence then allows determination of the rela- 
tive orientation of AMM bound to MBP. In general, 
deterrnination of the alignment from a protein is 
possible using *H- 15 N data as we have recently 
demonstrated for other systems (Fischer et aL; 



V the AMM in thTE^itate (' 
be an average over time, — 



will 

v ^ _ that are 
related by r MBP's Wold 'syi^etry axis. As has 
long been recognized and used in studies of small 
molecules (Saupe, 1968) this averaging has a very 
specific effect on order matrices determined from 
averaged couplings; regardless of the nature of 
MBPs interaction with the bicelle medium, AMM's 
experimentally determined direction of highest 
order must always point along the symmetry axis 
of MBP and the resulting order tensor must always 
be axially symmetric (Al-Hashimi et al., 1999). 

In the absence of any large conformational flexi- 
bility and mobility in the bound state, the order 
parameters determined from bound AMM (S^ = 
- 0.0012(±0.00012)) will reflect MBP's order par- 
ameter. The fact that the principal order parameter 
is negative indicates that MBP's symmetry axis is, 
on average, perpendicular to the magnetic field 
and parallel with the normals of the bicelles in our 




Figure 3. Depiction of AMM in fast exchange with the 
three equivalent CRDs of MBP. The 3-fold axial sym- 
metry results in a direction of highest order along the 
rotor axis. Yellow spheres correspond to Ca 2+ , black 
and red shperes to carbon and oxygen, respectively, of 
AMM, and MBP is represented by ribbon diagram. 
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medium (these order with their normals perpen- 
dicular to the magnetic field; Sanders et ai, 1994). 
This is as one would expect if interactions with the 
surfaces of bicelles dictated order of MBP. Axial 
symmetry should also propagate to the bound 
ligand and this is exactly what we observe in the 
order tensor determination using residual dipolar 
contributions from MBP bound AMM as shown in 
Figure 2(b). The direction of Highest order, in 
Figure 2(b) should iwwip^tfra symmetry 
bios of MBP; As >ve discuss below;" the orientation 



order will vary. A fourth possibility is that the 
binding geometry in the primary site can vary. 

In Figure 4 we show a geometry of AMM in the 
binding site of MBP (lKMB.pdb) that is consistent 
with our determined axis of symmetry. Using this 
refined structure, s-methyl mannose was docked 
to the binding calcium of the CRD of MBP. Orien- 
tational constraints required placing AMM's exper- 
imentally derived direction of highest order along 
MBFs? 3-fold s aod*^ of ^Subseouentlv, 




y, and a^lv^uTe 






^distance from 



— nmm& 

a^mm^ifmmomw ^alldwed van: der 

^ j^f^oyj^vr *!^^j**i£V 




i^gS^^^ between HO-3 and 

Glut 93 of MBP. The fimcScT forarused in our 



of trinl^^Kffi with 
AMM in the binding site actually exists. However, 
AMM in the site of a dimer of the homologous 
MPB-C, missing the helical stem exists, and AMM 
in the trimer can be positioned by overlaying a 
carbohydrate-binding domain from the dimer on 
the three domains in the trimer structure (PDB 
entries 1KMB and 1RDL, rmsd 0.63-0.78 A). The 
orientation of AMM, when positioned so that the 
experimentally determined symmetry axes coincide 
with that of MBP cannot, however, be made to 
overlap with the orientation of AMM modeled in 
this way. It is displaced by approximately 40°. 
There are several possible explanations for this 
observed deviation. First, our use of the trimer as a 
model may be in error. Instead, an equilibrium 
among monomer, dimer and trimer forms may 
exist in solution. This possibility is unlikely, since 
line widths and diffusional properties of the con- 
struct used in previous studies are consistent with 
a trimer (Sayers, 1998). Moreover, the experimen- 
tally determined axially symmetric tensor is con- 
sistent with the expected 3-fold averaging. Second, 
the actual geometry of the trimer in solution may 
be different from what is depicted in the crystal 
structure. This is not out of the question; there are 
in fact X-ray structures of mutated MBPs in tri- 
meric aggregates which show substantial differ- 
ences in the relative placement of the CRDs, even 
though mutations are concentrated in loops at the 
extremities. For example, the helices near the tri- 
mer junction in PDB entries 1KMB and 1AFA are 
displaced from one another by more than a 1 A 
IPI when the stems are aligned. Similarly, the presence 
^^^,of internal motion in each protein domain or in 
"Hfi^^k* m me bound state can lead to deviations of 
^f? ge orientati o n s- Third, it may be that alternate 
ing geometries are possible. A second binding 
Evolving a second Ca 2+ , in each CRD has, in 
^been observed in crystals containing a high 
saccharide (Ng et aL, 1996). If this site were 



modeling has no ligand of AMM in the binding 
site and Glul93 coordination of the calcium com- 
pensates for the absence of sugar hydroxy 1. Hence, 
it is quite likely that Glul93 moves on sugar bind- 
ing. MBP actually has a rather open binding site, 
in which coordination of the three and four oxygen 
atoms of the sugar to Ca 2+ are the primary inter- 
actions and considerable variation in the actual 
geometry of coordination may be possible. 

An independent deterrnination of MBP's order 
tensor would remove some of the uncertainties in 
the above discussion. First, this would allow deter- 
mination of the relative domain orientations of 
MBP in solution. This can be achieved by equating 
the observed direction of highest order as seen 
from any single domain to the symmetry axis of 
trimeric MBP (Al-Hashimi et al, 1999). This pro 
cedure, in fact, introduces a novel and simple route 
into determination of relative domain orientations 
in certain types of symmetry related homo-mul ti- 
mers such as MBP, a problem that so far has been 



(Ng et al, 1996) 
aproper model would have to include 
mJ** one ntation of the sugar in this 
!tB»t the second and third cases would 
symmetric nature of the deter- 
but the direction of highest 




Figure 4. Proposed mode of binding of AMM in MBP. 
The dark blue corresponds to Ca 2+ , and the light blue 
represents residues in the binding pocket of MBP. 
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very difficult to address using standard NMR tech- 
niques. Second, the determined direction of highest 
order «nd the co r re sp onding order parameters will 
havetibsorbed any internal motional averaging 
contribution* within MBP permitting direct com- 
rSorf with ^WSminoT ieiermined for AMM 



diacetylated a-methyl mannoside. Flash chromatography 
followed by doocctytaHon with NaOMe/MeOH pro- 
vided AMM In an 20% overall yield. 



NMR ip#ctroiCOpy 





Two *»mpfca were prepared: (1) 



1 mM MBI'A, 
25 mM CaO, 

"~ Mi 

¥5 




^^^^^^^S^^^mmmmmm^-^ accomplished 



-ft: 



III 



Preparation of MBP-A 

^ MBP-A is an expression product from the pIN-IUom- 
pA-2 plasmid (Drickamer, 1989), and was expressed as 
described (Weis et a/., 1991). Briefly, competent JA221 
cells were transformed with DNA plasmid and colonies 
selected from LB-amp plates for use in a starter culture 
in an LB-amp medium- Starter culture was grown to sat- 
uration overnight at 37 °C and used to inoculate five 
liters of LB-amp medium, which grew to of 0.8. 
MBP-A production was then induced with IPTG and 
additional growth allowed for 2.5 hours. Cells were har- 
vested by centrifugation at 4000 g, the pellet was resus- 
pended in 10 mM Tris-Cl (pH 7.0) and then lysed by 
sonication. The insoluble pellet from 18,000 g centrifu- 
gation was solubulized in 6 M guanidinium chloride, 
0.1 M Tris-Cl (pH 7.0), and clarified by centrifuging at 
138,000 g for 30 minutes at 4°C. Dialysis of the centri- 
fuge extract with 25 mM Tris-Cl, 1.25 M NaCl, 25 mM 
CaCl 2 over 48 hours and centrifuging at 138,000 g pro- 
vided the dialysate which was purified by affinity chro- 
matography on a mannose-Sepharose column. Elution 
with 25 mM Tris-Cl, 1.25 M NaCl and 2.5 mM Na 2 EDTA 
followed by reconstitution by dialysis against 10 mM 
NaCl, 1 mM Tris-Cl, 25 mM CaCl 2 afforded 10 mg of 
MBP-A from a five liter growth culture. 



Preparation of AMM 

AMM was synthesized from [ 13 C 6 ] glucose by minor 
modification of a described method (Hare et a/., 1993). 
Briefly, uniformly l3 C-labeled r>glucose was protected as 
the penta-acetate with acetic anhydride, and the crude 
material brominated with HBr/AcOH. Reduction with 
Zn/ aqueous AcOH provided the desired tri-O-acetyl-D- 
glucal, which was used in a Ferrier reaction with metha- 
nol /Do wex H + to give (ot-methyl-2,3-dideoxy-4,6-di-0- 
acetyl-D-glucopyranoside. Purification by silica gel 
chromatography; followed by treatment of the anhyd- 
rous material with Os0 4 in pyridine afforded the desired 



maximal vallgnment 

wete ~ acquired f using normal 
single quantum coherence 
so ; iftatS??C couplings 
direct ; jwo*Mi dimension. Quadrature 

using gradienf coherence selection. For all experiments, 
1024 points per scan, 16 scans per increment, 160 1 } incre- 
ments were acquired with a direct sweep width of 
3000 Hz and an indirect sweep width of 9000 Hz. Using 
the same acquisition parameters, another identical HSQC 
experiment was acquired at a temperature of 25 °C 
where the bicelie medium is in an isotropic solution 
state. 



Calculations of dipolar couplings and order tensors 

Dipolar couplings were calculated as the difference 
between the oriented couplings (Vch + ^ch) the iso- 
tropic couplings (Vch)* m eacn case / couplings were 
extracted using a Bayesian time-domain NMR parameter 
estimation program Xrambo, using the method described 
(Andrec & Prestegard, 1998). This program "is available 
on the Internet at http://tesla.ccrc.uga.edu. Typically, a 
2D HSQC data set with the desired splittings in the 
direct dimension was transformed and phased to yield a 
1024 x 256 real matrix. Single FID slices across the width 
of the resonance under investigation were analyzed inde- 
pendently. For every slice, the resulting frequency 
domain data were reverse Fourier transformed to gener- 
ate a ID time domain data set as input for Xrambo. The 
following model was used for the data. Each component 
of the doublet resulting from C-H couplings was given 
an identical linewidth and intensity, but an independent 
phase to circumvent any problems resulting from the 
presence of phase twist anomalies or dispersive contri- 
butions. Values for shifts, linewidths, phases and intensi- 
ties were estimated and entered as starting parameters 
which were subsequently refined by Xrambo's Metropo- 
lis Monte Carlo method. This procedure was repeated 
for several indirect slices of the same doublet and the 
resulting rmsd across various slices was used as the 
uncertainty in measurement. The error analysis from this 
procedure was used to estimate the final precision of all 
measured residual dipolar couplings. The measured 
residual dipolar couplings and associated uncertainties 
along with AMM input coordinates obtained from an 
MD simulation were then used as input to a singular 
value decomposition program for the determination of 
order tensor elements (Losonczi et al. f 1999). This 
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program is also available on the internet at http:// 
tesla.ccrc.uga.edu. 

Molecular dynamics simulation 

MD simulations were performed with AMBER 
4.1 (Pearlman et al.. 1995) employing the all-atom 
CLYCAM93 parameter set for oligosaccharides (Woods 
et al., 1995) on a 12 processor SGI Origin 2000 comp- 
uter.^ AMM- was solvated in an approximately 
32 A^i 32 A x 32 A box of 466 TTP3P water molecules 
^^gj^^^Sj^Bd by steepest descent energy minimiz- 
M^WSM&^Ktxk constant of IJO and a^cut-off 
^^m^rmiSded pair mtectioniH&f 8.0 A^^mWal^ 



at. 5 K, and the 'simu-; 
fitl^ nm?at^300 K - at 1 atm. Selected torsions ; were 

CARNAL module of AMBER 4.1 from 
^W^aa^t^ ^era^ a closely related family of 
the most jpbpulated structures. 

Docking studies 

dj^^^o^ations of AMM's position in MBP were 
^MkyaDJ^l^ (Ferrin et al., 1988). Briefly, MBP 
vsdm AMM modeled into the CRD were aligned with the 
helix parallel with the z-axis. A second AMM was 
matched; and oriented according to the singular value 
decom p ration analysis of the calculated dipolar coup- 
lings (a z - 10°, y + 40° rotation from the crystal struc- 
ture). This was then translated in x, y, and z directions 
while limiting rotations to the z-axis only. Solutions that 
maintained the three and four OH groups 2.6-2.8 A from 
the Ca +2 responsible for binding and that minimized van 
der Waals interaction with the protein were deemed 
plausible solutions. ~ 
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